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FOREWORD 


On  a  continuing  basis,  the  Army  Mathematics  Steering  Committee  (AMSC) 
sponsors  three  annual  conferences.  These  meetings,  in  the  areas  of  applied 
mathematics,  numerical  analysis  and  statistics,  are  designed  to  promote 
better  communications  among  Army  scientists.  The  oldest  member  of  this 
series,  the  Conference  of  Army  Mathematicians,  held  its  twenty-second 
meeting  at  the  Benet  Weapons  Laboratories,  US  Army  Watervliet  Arsenal, 
Watervliet,  New  York,  on  13-14  May  1976.  Dr.  Moayyed  A.  Hussain,  the 
Chairman  on  Local  Arrangements,  took  this  assignment  seriously,  and  he, 
together  with  other  members  at  Watervliet  Arsenal,  are  due  the  thanks  of 
all  the  attendees  for  an  exceptionally  well-planned  meeting. 

The  ninth  Conference  of  Army  Mathematicians  also  had  as  its  host 
Watervliet  Arsenal.  Statistics  from  these  two  meetings  point  out  some 
of  the  changes  taking  place  in  these  affairs.  The  ninth  Conference  had 
65  attendees,  while  the  present  meeting  entertained  94  persons.  The  1963 
meeting  had  one  invited  speaker  and  24  contributed  papers,  while  the  1976 
Conference  had  6  invited  speakers  and  44  contributed  papers.  The  most 
encouraging  statistic  in  these  figures  is  the  increase  in  the  number  of 
contributed  articles.  While  5  of  the  44  papers  in  this  class  were  given 
by  University  professors,  this  still  leaves  a  sizable  increase  in  the 
number  of  scientific  papers  being  presented  by  Army  scientists. 

The  Subcommittee  on  Applied  Mathematics  of  the  AMSC  has  charge  of  the 
planning  of  the  Conference  of  Army  Mathematicians.  It  selects  invited 
speakers  whose  fields  stress  areas  of  applications  of  mathematics  which 
meet  the  needs  of  the  Army.  It  also  selects  some  speakers  that  address 
fields  which  meet  the  special  interests  of  the  host  installation.  From 
the  titles  of  the  addresses  of  the  invited  speakers  listed  below,  one  may 
note  that  the  requirements  of  the  host  in  the  area  of  fracture  mechanics 
is  stressed  in  several  of  these  talks. 

Nonlocal  Elasticity  and  Fracture  Mechanics 

Professor  A.  C.  Eringen,  Princeton  University 


Unsteady  Problems  in  Combustion  Using  Activation  Energy  Asymptotic 
Professor  John  Buckmaster,  University  of  Illinois 

A  Return  to  Input-Output  Methods  in  Statistical  Theory 
Professor  Thomas  Kailath,  Stanford  University 

Three-Dimensional  Cracks  and  Weight  Functions 

Dr.  Hans  S.  Bueckner,  General  Electric  Company 

Recent  Developments  in  the  Theory  of  Elasticity  and  Rupture  of 

Fluid  Infiltrated  Solids 

Professor  James  Rice,  Brown  University 

In  addition  to  the  above  speakers.  Professor  George  H.  Handelman 
of  Rensselaer  Polytechnic  Institute  gave  an  invited  address  at  the 
banquet  which  was  held  on  the  first  evening  of  the  Conference. 

Members  of  the  AMSC  were  pleased  that  representatives  of  the  Air 
Force,  the  Navy,  and  the  Department  of  National  Defence  of  Canada  were 
in  attendance  at  this  symposium.  They  were  also  pleased  to  note  the 
host  installation  had  22  of  their  staff  members  listening  to  the  pre¬ 
sented  papers. 

The  last  two  articles  appearing  in  these  Transactions  were  not  given 
at  the  Conference  of  Army  Mathematicians.  These  papers,  one  by  Dr.  Achi 
Brandt  and  the  other  by  Professor  Gene  H.  Golub,  resulted  from  invited 
addresses  delivered  at  the  1976  Army  Numerical  Analysis  and  Computers 
Conference  held  11-12  February  1976  at  the  US  Army  Research  Office. 
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All  general 
the  second 
Watervliet, 

0745 

0800-0830 

0830-0845 

0845-0945 


0945-1000 


THE  22nd  CONFERENCE  OF  ARMY  MATHEMATICIANS 
Maggs  Research  Center,  Watervliet  Arsenal 
Watervliet,  New  York 


and  technical  sessions  will  be  held  in  Rooms  240  and  215,  on 
floor  of  Maggs  Research  Center,  Bldg.  115,  Watervliet  Arsenal, 
New  York 


Wednesday,  12  May  1976 


BUS  FROM  HOLIDAY  INN  TO  WATERVLIET  ARSENAL 

REGISTRATION  -  RECEPTION  LOUNGE,  1st  FLOOR,  MAGGS  RESEARCH 
CENTER 

OPENING  OF  THE  CONFERENCE,  WELCOMING  REMARKS  -  ROOM  240 


GENERAL  SESSION  I  -  ROOM  240 


SPEAKER: 


TITLE: 


Professor  A.  Cemal  Eringen 

School  of  Engineering  and  Applied  Science 

Princeton  University 

Princeton,  New  Jersey 

Nonlocal  Elasticity  and  Fracture  Mechanics 


CHAIRMAN:  Dr.  E.  A.  Saibel 

US  Army  Research  Office 
P.O.  Box  12211 

Research  Triangle  Park,  North  Carolina 


BREAK 
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0.  L.  Bowie  and  C.  E.  Freese,  Army  Materials  and 
Mechanics  Research  Center,  Watertown,  Massachusetts 
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STATE  OF  STRESS  IN  THE  NEIGHBORHOOD 

* 

OF  A  SHARP  CRACK  TIP 

A.  Cemal  Eringen 
Princeton  University 


ABSTRACT 

Field  equations  of  nonlocal  elasticity  are  solved  to  determine  the 
State  of  stress  in  the  neighborhood  of  a  line  crack  in  an  elastic  plate 
subject  to  uniform  tension  perpendicular  to  the  line  of  crack  at  infinity. 

It  is  found  that  no  stress  singularity  is  present  at  the  crack  tip.  \^en ^ 
the  maximum  hoop  stress  is  equated  to  the  cohesive  stress  Griffith  criterion 
of  fracture  is  obtained  with  the  Griffith  constant  fully  determined. 

Cohesive  stress  necessary  to  break  the  atomic  bonds  are  calculated  for 
A£,  Ni,  Fe,  LiF,  Diamond  and  Zn.  The  results  are  in  excellent  agreement 
with  those  known  in  the  atomic  theory  of  lattices  and  experiments. 


1.  INTRODUCTION 

The  determination  of  the  state  of  stress  near  the  tip  of  a  sharp  crack 
in  an  elastic  plate  subject  to  uniform  tension  perpendicular  to  the  line 
of  crack  at  infinity.  Fig.  1,  is  one  of  the  most  fundamental  problems  in 
fracture  mechanics.  The  solution  of  this  problem  was  first  given  by  Inglxs 
[1913]  and  it  was  used  by  Griffith  [l920]  to  establish  his  celebrated 
criterion  for  fracture  of  solids.  The  classical  elasticity  solution  of^ 
this  problem  gives  a  hoop  stress  with  a  /r  singularity  near  the  crack  tip, 
where  r  is  the  distance  from  the  crack  tip.  Thus,  according  to  classical 
elasticity  the  stress  is  infinite  at  the  crack  tip  for  even  a  minute 
amount  of  applied  tension.  Since  a  plate  with  a  sharp  crack  possesses  a 
certain  amount  of  resistance  to  fracture  until  the  applied  tension ,  t^, 
reaches  a  critical  value  determined  by  the  so-called  Griffith  criterion 

(1.1) 

where  £  is  the  half  crack  length  and  is  an  experimental  constant  (Griffith 
constant) ,  it  must  be  concluded  that  classical  elasticity  solution  fails 
to  apply  near  the  crack  tip.  This  conclusion  is  responsible  for  the  abandon¬ 
ment  of  maximum  stress  hypothesis  for  failure  which  has  been  prominent  in 
structural  mechanics.  Consequently,  for  brittle  solids,  since  the  time  of 
Griffith,  two  distinct  fractuie  criteria  have  been  ir.  use,  one  for 
structural  members  with  no  cracks  and  one  for  those  containing  cracks .  In ^ 
fact,  the  state  of  the  art  is  more  involved,  far  beyond  this  dichotomy,  and 
many’  other  fracture  criteria  have  been  Introduced  by  other  authors  to  over- 
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come  this  stress  singularity  (e.g.,  J-integral,  Barenblatt  theory  [1962] 
Khristianowich  [l955]  -Dugdale  theory  [i960],  Goodier  &  Kanninen  locally 
nonlinear  theory  [1966],  etc.).  Below  we  give  a  brief  discussion  of 
these  theories  A  thorough  discussion  of  the  status  of  the  art  is  to  be 
found  in  Goodier ’s  [1968]  article. 


Griffith  Criterion.  Griffith  assumes  that  the  work  done  to  extend  a 
line  crack  of  length  2l  an  amount  of  26t  must  be  equal  to  the  work  of 
the  surface  tension.  In  this  way  he  arrived  at  the  formula  (1.1)  with 


(1.2) 


'G 


2E 

7T  (1-V^)  ^ 


where  E  is  the  Young’s  modulus,  v  is  the  Poisson’s  ratio  for  an  isotropic 
elastic  plate  and  y  is  the  surface  tension  energy.  The  surface  tension 
energy  y  he  employed  is  that  borrowed  from  fluid  statics.  In  obtaining 
(1.1),  the  Inglis’  solution  for  the  elliptic  hole  was  used  with  the 
provision  that  in  the  limit  the  minor  axis  of  the  ellipse  approach  to 
zero.  This  theory  has  been  under  criticism  for  over  half  a  century 
nevertheless  surviving  all  criticisms.  Basic  complaints  may  be  sum¬ 
marized  as: 

(i)  Crack  tip  stress  is  infinite  no  matter  how  small  the  applied  load  is. 

(ii)  The  crack  opens  up  into  an  ellipse,  so  that  the  shear  strain  at 
the  tip  is  too  large  (rr/A)  for  the  linear  theory  to  be  applicable. 

(iii)  Ellipse  shrinking  to  a  crack  may  not  be  "uniform,”  mathematically, 
i.e.,  other  shapes  may  give  different  limits. 

(iv)  The  surface  tension  energy  y  borrowed  from  fluid  statics  may 
not  be  appropriate  for  solids. 

Barenblatt  Model.  To  overcome  the  objections  (i)  and  (ii)  B.arenblatt  [l962] 
assumed  that  the  tip  region  of  the  crack  is  not  free  of  tractions  but  there 
exists  a  cohesive  stress,”  a(x),  distributed  in  such  a  way  as  to  bring  the 
crack  tip  to  a  cusp.  Fig.  2.  He  then  determined  the  shape  of  a(x)  to 
achieve  the  cusp  form. 

Khr is tianovjicTi^Di^d^^  Khristianovich  [1955]  and  Dugdale  [l966] 

assumed  that  beyond  the  crack  tip  over  a  small  length  s  there  is  a 
constant  cohesive  stress  distribution  to  close  up  ends  of  the  crack. 

Fig.  3. 


Clearly  both  Barenblatt  and  Khristianwich-Dugdale  theories  are 
objectionable  for  their  uses  of  heuristic  assumptions  not  justifiable  on 
the  basis  of  any  physical  principles  or  experimental  work. 

Goodi  er-Kanninen  Mo^l .  According  to  Goodier  and  Kanninen  [1966]  the 
atomic  interactions  are  important  at  the  tip  of  a  crack.  In  order  to 
overcome  the  objection  (iii)  they  use  nonlinear  springs  along  the  tip 
of  the  crack.  The  extent  of  nonlinear  springs  and  their  properties  are 
left  to  our  discretion,  \\niile  the  basic  idea  of  inclusion  of  long  range 
interatomic  interactions  are  worthy  of  careful  attention  the  model  contains 
arbitrary  factors  and  functions  to  be  fixed  to  suit  the  purpose. 
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Remarkably  common  to  all  these  models  is  the  unequivocable  realize 
tion  that  near  the  crack  tip  interatomic  cohesive  forces  must  he 
important. 

There  exist  solutions  of  Inglis'  problem  by  using  polar  theories, 
e.g.,  couple  stress,  theory  (Sternberg  and  Muki  [1967]),  micropolar 
theory  (Kim  and  Eringen  [1973]).  These  results  also  contain  the  same 
type  of  singularities  and  therefore  no  further  progress  is  possible  on 
these  grounds. 

Recently  we  have  developed  a  continuum  theory  that  takes  into 
account  the  effect  of  long  range  Interatomic  attractions.  According  to 
this  theory  the  stress  at  a  point  of  an  elastic  solid  is  influenced  by 
the  strains  at  alt  points  of  the  body.  All  known  physical  and  thermo¬ 
dynamic  principles  were  satisfied  (cf.,  Eringen[l972a,b] ,  Eringen 
Edelen  [1972]).  \7hen  the  nonlocal  theory  is  employed  for  the  solution 
of  the  crack  tip  problem  one  finds  that  the  stress  field  at  the  crack 
tip  is  no  longer  singular  and  therefore  it  is  possible  to  reveit  back 
to  the  maximum  stress  hypothesis  for  fracture  criterion.  Remarkably 
enough  this  theory  not  only  gives  Griffith's  criterion  without  any  new 
assumption  but  also  determines  the  Griffith  constant.  In  fact  the 
cohesive  stress  calculated  for  various  materials  are  in  excellent 
agreement  with  those  known  from  the  atomic  theory  of  lattices  and 
experiments.  The  main  purpose  of  the  present  paper  is  an  exposition  of 
these  results. 


2 .  BASIC  EQUATIONS  OT^  NONLOCAL  ELASTICITY 

Basic  equations  of  linear,  homogeneous,  isotropic,  nonlocal  elastic 
solids  with  vanishing  body  and  inertia  forces  are  (cf.,  Eringen  [1972bJ) 


(2.1) 

''k^,k 

=  0 

(2.2) 

V  = 

[X’dx'-xj)  e^(x')  6^^  +  2y'(lx’-xl)  ej^(x’)]  dv(x') 

V 

(2.3) 

where  the  only  difference  from  classical  elasticity  is  in  the  stress 
constitutive  equations  (2.2)  which  states  that  the  stress  ^ 

point  X  depends  on  strains,  e,  ff(x'),  at  all  points  of  the  ^<^7- 
homogeneous  and  isotropic  solids  the  material  moduli  A'(lx  “51' 
p'(lx'-x|)  are  functions  of  the  distance  between  the  points  x'  and  x. 
The  integral  in  (2.2)  is  over  the  volume  ^  of  the  body  enclosed  within 
the  surface  9  I/. 

Here  and  throughout  we  employ  Cartesian  tensors  with  repc^ated 
indices  that  indicate  sunmiation  over  the  range  (1,2,3)  and  indices 
following  a  comma  partial  differentiation,  with  respect  to  space 
coordinates,  e»g. 
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In  our  previous  work  [l972b,  1974]  we  have  obtained  the  form  of  X’(|x*~x|) 
and  P^(|x*-x|)  for  which  the  dispersion  curves  of  p].ane  waves  coincide 
with  those  obtained  in  Born-von  Karmdn  theory  of  lattice  dynamics  within 
the  entire  Brillouin  zone.  Accordingly 


(^'j  P')=  e,M)  «(|x'-xl) 


(2.4) 


a ( I X ’ -X I )  = 


(a  -  |x'-x| ) 


X  ’  -X I  ^  a 
x'-xl  >  a 


where  a  is  the  lattice  parameter.,  1  and  p  classical  Lame  constants  and 
is  a  normalization  constant  to  be  determined  from 


(2.5) 


a(  I x'-xl)  dv(x')  =  1 


V 


Since  the  nonlocal  effects  are  most  important  along  the  edge  of  the  crack 
we  use  (2.4)  and  (2.5)  at  x  =  0  to  determine  a  .  This  gives  a  =  b/ira^. 
Upon  carrying  (2.4)  into  (2.2)  we  will  have  °  ° 


(2.6) 

where 

(2.7) 


■\dt 


dv(x') 


1/ 


^/(x')  =  A  e,^(x')  6^,  +  2p  ej^(x’) 


is  the  classical  Hooke’s  law. 


Substituting  (2,6)  into  (2.1)  and  using  the  identity 


,k'  “  '^lee,k' 


and  Green-Gauss  theorem  we  obtain 


(2.8) 


“(Is'-xl)  ^i^(x')  daj^(x')  + 


9  1/ 


1/ 


a(|x'-x|)  Oj^^j^.(x')  dv(x’)  =  0 


Here  the  surface  integral  may  be  dropped  if  the  effect  of  the  surface 
tensions  are  negligible  or  the  body  extends  to  infinity  in  all  directions. 
We  assume  tliis  is  the  case  so  that 


(2.9) 


1/ 


“(l5'“x|)  Oyj  j^,(x')  dv(x')  =  0 


It  is  not  difficult  to  prove  that  if  a(|x'-x|)  has  a  bounded  sunnorl:  and 
^kf.,k  continuous  in  U  then  the  necessary'and  sufficient  condition  for 
(2.9;  to  be  satisfied  is,  cf . ,  Eringen  [1976] 


(2.10) 


'k£,k 
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Equation  (2.10)  together  with  (2.7)  are  none  other  than  Navier’s  equation 
for  the  displacement  field  u(x).  From  this  result  it  follows  that 

Theorem.  The  displacement  field  of  the  nonlocal  elasticity  (under  the 
conditions  stated  above)  satisfy  Navier's  equation. 

For  the  displacement  boundary-value  problem  (1st  boundary-value  problem) 
this  implies  that: 

Corollary.  The  displacement  field  of  the  first  boundary  value  problem  of 
the  ?zonlocal  elasticity  is  identical  to  that  of  the  classncal  elast^c^ty. 

Note,  however,  that  to  obtain  the  stress  field  we  must  substitute 
obtained  from  the  classical  theory  into  (2.6)  and  carry  out  the  volume 
integration.  Thus,  for  boundary  conditions  on  the  tractions  we  must  employ 

(2.11)  on  31/j. 

on  that  part  of  the  surface  8  where  the  traction  t^  is  prescribed. 


3.  CRACK  PROBLEM 

Consider  a  plate  weakened  by  a  sharp  line  crack  of  length  2t.  The 
plate  is  subject  to  a  uniform  compression  t^  at  the  crack  surface  and 
free  of  tractionsat  infinity.  The  displacement  field  u^  =  u(x,y), 
u  =  v(x,y)  in  the  upper  half  plane  y  >  0  are  given  by  the  classical 
elasticity  solution  (cf.,  Sneddon  [1951,  p.  404]). 


_1 


D 

I  i  J^lkl  A(k)  +  (ikly  -  B(k) 


exp 


(-Ik  y  -  ikx)  dk 


(3.1) 


w 

I  [^A(k)  +  y  B(k)j  exp(-lk|y  -  ikx)  dk 


where  A(k)  and  B(k)  are  two  functions  to  be  determined  from  the  boundary 
conditions  at  y  =0.  These  conditions  are: 

t  =0 
yx 

-  -'o 

V  =  0 

To  obtain  the  solution  of  the  crack  problem  with  cra(:k  surface  free  of^ 
tractions  and  the  plate  is  subject  to  a  uniform  tension  tyy  -  t^  at  y 
(Fig.  1)  to  the  solution  of  the  above  problem  we  superimpose  a  uniform 

stress  field  t^^  =  t^. 

Substituting  (3.1)  into  (2.3)  and  (2.7)  we  calculate: 


y  =  0 
y  =  0 
y  =  0 


¥■  x 

1x1  <  t 


X 


>  I 


5 


k|  A(k)  ~  B(k)j  exp(-|k|y' 


-  ikx ' )  dk 


(3.3) 


oo 

+  (k  y'  --v^)  “«]  »P<-|k|y 


-  ikx ' )  dk 


According  to  (2.6)  then  we  have 


(3.4) 


tyy(x,y)  = 


t  (x,y) 
yx  ’  ^ 


a(|x'-xl)  Oyy(x',y')  dx'  dy' 


>.(|x'-x|)  a  (x’,y')  dx'  dy 
~  ^  yx 


Substituting  (3.3)  and  (2,4)  into  (3.4)  and  after  carrying  out  integrations 
on  X  and  y  ,  we  set  y  =  0  in  these  equations  and  in  (3.1)o  to  form  the 
boundary  conditions  (3.2).  As  in  classical  treatment  (3.2),,  can  be  used 
to  determine  B(k)  in  terms  of  A(k) .  The  process  is  lengthy  and  tedious. 

We  only  give  the  resulting  expression 

(3.5)  B(k)  =  k^a^  +  cos(ka)  +  j  ka  sin(ka) 

+  f  k3a3  Si(ka)  -  I  TT  k3a3  -  A(k)/j(j  k^a^  ^ 

3  'X+T  “  10  ~  Jo  k'*a'')cos(ka) 

■^(3  T+i'  ~  10  Jo  k^aB^sindca)  +  y  k3a3 


TT  k^a 


+  ~  k5a5^Si(ka) 


1  1  5  5  I  ^ 


v;he.re  Si(z)  is  the  sine  integral  defined  by 


3  _  A 

A+y  3  A+y 


Si(z) 


sin  t 


With  B(k)  given  by  (3.5),  the  boundary  condition  (3.2),  is  satisfied  and 


(3.2)2  and  (3.2)  load  to 


6 


00 


(3.6) 


(2/Tr)^ 


j  A(k)  cos(kx)  dk  =  0 

0 


5 


cA 

(2/u)''^- 

0 


k  a(ka)  A(k)  cos(kx)  dk  = 


X  >  £. 


5 


X  <  £ 


where 

(3.7)  T  “  t  (X+2y)/2y(X+y) 

^  ^  o  o 

a  (ka)  =  -  f  {  [(j  f)  cos(ka)  +  ^  ka  sin(ka) 


t  k^a^  ; 

Si(ka) 

-  Ill  k^; 
6  , 

i5 

I  k^aS 

5„5  ^  +.11 
®  X+2y 

- 

i  ,.3„3 

5  X+2y 

ik^a’ 

X+V  \ 
X+2y  ) 

cos(ka) 

+ 

/i  1,434  _  J_ 
\3  10 

k**a^ 

A+y  \ 

X+2y  / 

2q  k  a 

x+y  ) 

X+2y  / 

t  sin(ka) 

+ 

( —  k^a^  +  -t 
\3  20 

k^aS 

x«„)  ^^0-) 

•“TT  k^a^ 
6 

-fk 

3„3  _  2- 
^  40 

TT  k^a®  +  - 

^  X+2y  5 

k^a^ 

The  dual  integral  equations  (3.6)  must  be  solved  to  determine  A(k) . 
\#ien  this  is  done,  we  will  have  the  problem  solved. 


It  is  interesting  to  note  that  in  the  continuum  limit  a —>  0 

1  and  (3.6)  revert  the  dual  integral  equations  obtained  in  classical 

el.asticity  for  the  same  problem.  With  a  complicated  kernel  function  a(ka) 

the  solution  of  (3.6)  cannot  be  affected  in  closed  form.  However,  we 

can  take  advantage  of  the  known  classical  solution  to  reduce  the  problem 

to  a  Fredholm  integral  equation  which  is  more  amenable  to  numerical 

treatment.  To  this  end  let  A  (k)  denote  the  solution  of  the  dual  integral 

c 

equations  of  the  classical  theory 


I  A^(k)  cos(kx)  dk  =  0 


>  I 


(3.e) 


k  A  (k)  cos(kx)  dk  =  T 
c  o 


X 


<  t 


0 


Subtracting  (3.8)  from  (3.6)  we  will  have 
[A(k)  “  A  (k)]  cos(kx)  dk  =  0 


X 


>  I 


0 


k[A(k)  -  A^(k)]  cos(kx)  dk 


k  [l-a(ka)]  A(k)  cos(k>:)  dk 
X  <  t 


Treating  the  right  hand  side  of  these  equations  as  known,  wo  copy  the 
solution  of  these  equations  from  Sneddon  [1951,  p.  70]. 


A(k)  -  A  (k)  =  (2f,2/7T)  [J  (k^)  (l-n2)^  dl-a(ca)] 


JL 

f 

A(C)  cos(r,n£)  dC  dp  +  k£  (l-u^)^  du 


c[l-a(ca)]  ACc)  cos(Unu)  drJn^j  (£kn)cin] 


wherG  d^(z)  are  Bessel  functions.  After  carrying  out  integrations 

in  y  and  u  we  obtained  the  folloxAzing  integral  equation  of  the  second  kind 

00 

(3.9)  A(k)  -  J  p(ti2-k2)-1  j^(^)  _  ^  j^(k)] 


where 


E  k£ 


[l-a(iie)]  A(n)  dn  =  A  (k) 

c 


n  =  c£ 


£  =  a/£ 


(3.10)  A(k)  s  (2/v)'^  [2y (A+M)/£2t^(A+2p)]  A(k)  , 

A  (k)  -  (2/Ti)^  [2u(X+n)/C2t-  (A+2p)]  A  (k)  =  J  (k) /k 
^  o  cl 

in  which  the  last  equality  follows  from  the  classical  solution  for  A  (k) 

in  the  case  of  t  =  const.  ^ 

o 

kHien  (3.9)  is  solved  for  A(k)  then  the  displaceraent  and  stress  fields 
fo].loxj  from  (3.1),  (3.3)  and  (3.4).  Along  the  crack  line  (y  =  0)  these 
are  given  by 


(3.13) 


v((!,0)  [2]j  (X-i  y)  /  (A'f2]i)]/t^£  =  ^  A([c)  cos(k:4)  dK 


^yy(^5^)/to  "  t(5,0)/t^  =  -  K  a(Kc)  A(k)  cos(k(;)  dK 

0 


where  ^  n  x/t  . 
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The  integral  equation  (3.9)  is  non-singular  for  all  e  ^  0^^  For  e  =  0 
we  have  A(k:)  =  A  (k)  =  J2^(k)/k.  It  is  also  clear  that  l-a(Ke)  =  0(c  ) 
for  small  e«l.  ^The  smallest  length  crack  may  be  constructed  by  one 
missing  atom.  In  this  case  a  =  t  and  e  =  1.  Thus  0  ^  1  ^ 

micro-crack  of '100  atomic  length  e  =  1/50.  It  is  thus  expected  that 
the  contribution  of  the  integral  in  (3.9)  will  become  appreciable  only 
for  submicroscopic  cracks  of  few  atomic  distances.  In  fact,  this  turned 
out  to  be  the  case  when  (3.9)  was  solved  by  means  of  electronic  computers. 

The  numerical  calculations  were  carried  out  over  a  two  Brillouin 
zone,  k  =  2Tr/a,  by  discretizing  (3.9)  over  150  grid  points.  The  results 
will  be  reported  elsewhere.  Here,  however,  we  give  som.e  typical  cases. 

In  fact,  we  have  found  that  the  classical  solution  is  perfectly 
satisfactory  for  2l/a  ^  40  (still  a  submicroscopic  crack) . 

The  stress  concentration  for  the  case  when  the  crack  surface  is 
free  of  traction  but  the  plate  is  subject  to  uniform  tension  t^.^  =  t^ 
at  infinity  is  given  by 

(3.12)  P(x)  =  [tyy(x,0)/t^]  +  1 

The  fact  that  the  classical  solution  of  the  dual  integral  equation  (3.8) 
satisfies  the  boundary  conditions  extremely  well  for  2£./ a  ^  40  can  be 
seen  from  Fig.  4.  For  other  details  and  error  estimates  depending  on  e 
the  reader  is  referred  to  Eringen,  et  al  [1976] . 


4 .  COHESIVE  STRESS-FRACTURE  CRITERION 
The  stress  concentration  factor 

(4.1)  C(v)  =  (2£/a)"^  ?(-£) 

is  shown  in  Table  1  for  various  Poisson's  ratio  V  =  X/2(A+y)  valid  for 
It/ a  ^  100.  It  is  clear  that  0.676  C(v)  ^  0.845.  For  v  =  0.25, 

C  -  o7713  for  21/ a  >  100. 

By  means  of  (4.1)  we  make  the  following  very  significant  observa- 
tions: 

(i)  The  stress  field  based  on  nonlocal  theory  has  no  singularity  so 
long  as  a  ^  0.  In  the  continuum  limit  a—^Ojand  the  classical  square 
root  singularity  occurs. 

(ii)  A  maximum  stress  hypothesis  can  now  be  used  to  predict  the 
failure.  In  fact,  we  state  that:  fi7zen  tyy  =  oohesive  stress 

the  fvceture  wiZt  occur.  From  (3.1)  it. therefore  follows  that 

(4.2)  t^2£  =  [a/2  c2(v)]  t^2  =  Cg 

This  is  the  Griffith  criterion  for  brittle  fracture,  V7ith  extra  benefit 
that  the  Griffith  constant  Cq  is  now  fully  determined.  Interestingly,  no 
ad  hoc  constant  (e.g.,  surface  energy  y)  occurs  in  (4.2)  and  from  the 
value  of  Cq  it  is  clear  that  it  is  a  material  property,  i.e.  ,  it  is 
known  once  the  cohesive  stress  t^,  lattice  parameter  a, and  the  Poisson's 
ratio  V  are  knovm. 
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(iii)  The  verification  of  the  fact  that  fracture  toughness,  classically 
defined  by  H  t^  is  a  material  property  led  many  experimentalists 

to  carry  out  long  and  arduous  experiments  (cf.,  Freed  et  al.  [l97l];Brovm 
and  Strawley  [1966]).  If  (A. 2)  is  used  we  see  that 

(4.3)  Kj  =  (^Cg)^=  (tt  a/2)^  t^/C(v) 

is  indeed  a  material  property. 

(iv)  Cohesive  stress  t^  may  be  calculated  for  a  given  solid  by  use 
of  (4.2).  Griffith  surface  energy  y  appearing  in  (1.2)  has  been  the 
subject  of  a  great  deal  of  experimentation.  If  we  equate  (1.2)  to  (4.2) 
we  obtain 

(4.4)  t  ^a  -  Ky 

c 

where 

(4.5)  K  =  8C^(v)y/r(l^v) 


Calculations  may  now  be  carried  out  for  various  materials.  Employing 
the  experimental  values  listed  in  Table  2,  we  have  calculated  t^/E  based 
on  the  nonlocal  theory.  The  results  are  recorded  in  the  next  to  the  last 
column  of  Table  2.  The  entries  in  the  last  column  of  this  table  are 
the  estimates  of  t^/E  based  on  atomic  considerations.  Lawn  and  Wilshaw 
[1975,  p.  160]. 

The  remarkably  close  values  obtained  should  be  considered  to  be 
indicative  of  the  far  reaching  power  of  the  nonlocal  theory. 
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TABLE  1. 

Stress  Concentration  factor  at  Crack  Tip 

2£ 

vs.  Poisson’s  Ratio  ( — -  =  100) 

a 


Atomic  Gist. 


results  are  from  Table  7.1,  p.  160,  Lawn  and  Wilshaw  [l975]  . 
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f  t  t  I  1  1  I 


FIGURE  1 

Elastic  plate  weakened  by  a  crack 
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FIGURE  2 


Barenblatt  model  assumes  that  cohesive  normal  stress  0(x) 
act  at  the  tip  region  of  the  crack  surface.  The  form  of 
a(x)  is  to  be  determined  to  give  cusps  at  tips. 


FIGURE  3 

Khristianowich-Dugdale  Model 
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ABSTRACT.  In  the  analysis  of  cracks  lying  in  a  compressive  stress 
field,  the  classical  solution  of  elasticity  frequently  yields  unacceptable 
physical  results  -  often  predicting  an  overlapping  of  the  crack  faces.  A 
first  order  correction  to  these  solutions  can  be  found  by  admitting  crack 
surface  interference  and  searching  for  a  physically  compatible  displacement 
field. 

The  problem  of  a  center  (or  edge)  cracked  strip  under  in-plane  bending 
is  solved  from  this  viewpoint.  A  necessary  condition  for  a  physically  com¬ 
patible  solution  is  shown  to  be  the  vanishing  of  the  stress  intensity  factor 
at  the  crack  tip  in  the  otherwise  compressive  field.  Numerical  results  in¬ 
dicate  that  the  classical  solution  for  the  stress  intensity  factors  at  the 
crack  tip  in  the  tensile  field  underestimates  the  corrected  solution  by 
approximately  ten  percent. 

1.  INTRODUCTION.  Every  so  often  the  simplifying  assumptions  of  the 
classical  linear  theory  of  elasticity  can  lead  to  mathematical  solutions 
which  are  physically  unrealistic.  We  are  familiar  with  the  need  for  retain¬ 
ing  the  non-linear  terms  of  the  strain-displacement  relations  to  account 
for  the  instability  or  buckling  phenomena  observed  in  the  behavior  of  thin 
shells.  Another  type  of  deficiency  arises  in  the  analysis  of  configurations 
involving  cracks  lying  in  compressive  stress  fields. 

A  simple  example  illustrating  the  subject  of  this  investigation  is  pro¬ 
vided  by  a  rectangular  strip  with  a  central  crack  loaded  by  a  uniform  uni¬ 
axial  compression  normal  to  the  direction  of  the  crack.  Figure  la.  Assuming 
no  friction  across  the  crack  surfaces,  the  obvious  physically  acceptable 
solution  for  this  problem  predicts  the  tangency  of  the  crack  surfaces  AOB 
and  A0*B  with  a  stress  state  of  uniform  compression  acting  throughout  the 
strip  and  across  the  crack  surfaces.  Compare  this  solution  with  that  of 
reversing  the  signs  for  uniaxial  tensile  loading  -  an  assumption  consistent 
with  the  superposition  argument  of  classical  elasticity.  Clearly  the  re¬ 
sulting  infinite  compressive  stresses  at  the  crack  tips  and  the  negative 
displacements  predicting  an  overlapping  of  the  crack  surfaces  (Figure  lb) 
arrived  at  by  such  an  argument  is  a  physically  imacceptable  solution  of  the 
problem. 
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Figure  1.  Central  crack  in  rectangular  strip  under  uniaxial  compression, 

=  -  T. 

The  "overlapping’*  problem  illustrated  above  carries  over,  usually  more 
subtly,  to  a  variety  of  crack  solutions  when  a  portion  of  the  crack  configura¬ 
tion  lies  in  a  stress  field  which  is  compressive.  A  positive  symptom  of 
overlapping  in  the  vicinity  of  a  crack  tip  can  be  inferred  from  the  sign  of 
the  stress  intensity  factor  of  linear  fracture  mechanics.  If,  for  example, 

Kj  (the  conventional  Mode  I  stress  intensity  component)  is  negative,  then 
there  exists  local  overlapping  at  the  crack  tip. 

A  plan  of  modifying  the  classical  solution  by  tolerating  crack  closure 
but  no  overlapping  is  adopted  in  this  paper.  The  problems  corresponding  to 
internal  and  edge  cracks  in  a  strip  under  in-plane  bending  are  analyzed  from 
this  viewpoint  and  the  "error"  in  the  classical  solutions  is  assessed. 

2.  CENTRAL  CRACK  IN  AN  INFINITE  SHEET  UNDER  BENDING.  First,  we  con¬ 
sider  the  problem  of  a  crack  of  length  2L  with  center  at  Zq  in  an  infinite 
sheet  under  in-plane  bending  (Figure  2).  When  =  0,  the  crack  is  centrally 
located  with  respect  to  the  applied  load  and  crack  tips  A  and  B  obviously 
lie  in  compressive  and  tensile  stress  fields,  respectively.  We  shall  now 
show  that  both  the  classical  and  the  modified  solutions  of  this  problem  can 
be  found  by  the  Muskhelishvili  [1]  method  of  analysis. 
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Figure  2.  Crack  in  an  infinite  sheet  imder  in-plane  bending. 


The  Muskhelishvili  analysis  depends  on  the  determination  of  two 
analytic  stress  functions  (j)(z)  and  i|)(z)  with  the  stresses  and  displacements 
defined  as 


Oy  +  =  2[(j)'  (z)  +  (j)'  (z)] 

Oy  -  +  2iT^  =  2[z(f)"(z)  +  i|)'(z)]  (1) 

2vi(u  +  iv)  =  K<()(z)  -  Z(|)'  (z)  -  ij)(z) 

where  primes  denote  differentiation  and  bars  complex  conjugates.  The  con¬ 
stants  y  and  k  are  defined  as  y  =  E/2(l+v)  and  k  =  3-4v  (plane  strain)  and 
K  =  (3-v)/(l+v)  (plane  stress)  where  E  and  v  are  Young’s  modulus  and  Poisson’s 
ratio,  respectively. 

For  a  plate  with  no  crack,  the  stress  functions 


yield  the  stress  distribution 

a=0,T  =0,a=  -Ty  (3) 

which  is  of  course  the  desired  loading  for  large  |z|. 

The  physical  region  in  Figure  2  can  be  described  conveniently  by  the 
mapping 

z  =  ojCO  =  +  £^“(L/2)(?  +  (4) 

The  unit  circle  and  its  exterior  in  the  c-plane  are  mapped  into  the  crack 
and  its  exterior  in  the  z-plane.  In  particular,  c=l  maps  into  the  crack  tip 
A  and  C  =  -1  maps  into  the  crack  tip  B. 

The  stress  functions  <p(z)  and  if>(z)  can  now  be  considered  as  (pC^)  and 
where  4)'(z)  now  corresponds  to  cf  ’  (?) /(o*  (c)  ,  etc.  Using  the  well-known 
continuation  arguments  of  Muskhelishvili,  the  crack  is  traction-free  if  we 
set 

=  -  Kl/?)  -  w(l/?)(t>*(?)/a)»(?)  (5) 

and  the  extended  definition  of  (j)(?)  leads  to  a  function  continuous  across  the 
unit  circle.  On  the  other  hand,  from  (2)  the  loading  x:onditions  at  infinity 
require 

->  iTz^/8  ->  iT(L/32)  +  4z^Z^\]  (6) 

tKO  ^  -  iTz^/8  ^  -  iT(L/32)  +  4z^£^“? 

for  large  | ? | . 

Conditions  (5)  and  (6)  are  satisfied  by  choosing 

KO  =  [iTLV32]  +  4(zyL)£^“c  -  -  2]c"^  (7) 

+  [8iCT^/Q  sin  a  +  ACz^L)  Z'^'^] 

and  this  completes  the  fomial  solution. 

5.  THE  CLASSICAL  SOLUTION  WHEN  z^^  =  0.  The  classical  solution  for 
the  centrally  located  crack,  z^  =  0,  will  first  be  considered.  The  crack 
tip  A  lies  in  apparently  a  compressive  field  and  we  can  anticipate  a  physi¬ 
cal  incompatibility  of  the  solution. 
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The  stress  intensity,  ,  at  the  crack  tip  A  in  general  will  be 
msdc  up  o£  Modes  I  and  II  and  can  be  calculated  from 


K 


(1) 

lA 


i  K 


(1) 

2A 


=  2(j)'(l)[£^“a)"(l)]' 

=  -  T(L^^^/2)sin^a(sin  a  +  i  cos  a) 


whence 


(1)  = 
CD  = 


Similarly,  at 


(T/2)L^^^sin^  a 
x/2  2 

(T/2)L  '  sin  a  cos  a 

crack  tip  B  (corresponding  to  ^  =  -  1) , 


(8) 


(9) 


=  (T/2)L^'^^sin^  a 

(10) 

kC^^  =  -  (T/2) L^^^sin^  a  cos  a 
2B 

A  clue  to  the  unacceptability  o£  the  solution  is  negativeness  o£  . 

In  order  to  examine  the  physical  compatibility  o£  the  displacements  o£ 
the  crack  snr£aces,  we  introduce  a  (?,ti)  coordinate  system  wh^e  C-  and  n 
are  along  and  normal  to,  respectively,  the  crack  direction.  en 


u  +  i  u  =  %  ^*^(0  +  iv)  = 

5  h 

£or  the  crack  boundary  where  a  = 
5-plane.  The  condition  £or  no  " 
written  as 


r^“(K  +  l)4i(a)/2y  Cll) 

are  points  on  the  unit  circle  in  the 
overlapping"  o£  the  crack  boundaries  can  be 


u  (6)  -  u  (-0)  >0  0  £  6  £  IT 

n  n  — 


(12) 


When  =  0, 

u  (0)  -  u  (-0)  =  (k  +  l)TL^sin  a  sin  20[cos  2a  -  l]/16y  (13) 

Ti  n 

which  (except  £or  the  trivial  cases  a  =  o,  ir)  clearly  violates  the  no  over 
lapping  condition  (12)  in  the  interval  0  <  0  <  ir/2. 
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4.  DETERMINATION  OF  A  PHYSICALLY  ACCEPTABLE  SOLUTION.  The  plan  for 
determining  a  physically  acceptable  solution  depends  on  admitting  crack 
closure  over  segments  of  the  crack  without  overlapping.  If  the  crack  tip  is 
involved  in  the  region  of  overlapping,  as  is  the  case  in  the  present  problem, 
a  necessary  condition  for  an  acceptable  solution  can  be  e3q)ressed  in  terms 
of  the  stress  intensity  factors  from  a  consideration  of  the  local  stress 
and  displacement  fields. 

Consider  the  crack  tip  A  and  the  displacements  and  u^  to  the  first 
order  of  the  local  crack  tip  e3q)ansion.  A  necessary  condition  for  no  local 
overlapping  can  easily  be  shown,  Kj  ^  0,  from  a  consideration  of  If, 

in  addition,  we  assume  crack  closure  in  the  neighborhood  of  A,  then  must 
be  non-tensile  across  this  interval.  Therefore,  a  necessary  condition  for  a 
physically  acceptable  solution  is  Kj  =  0  at  A.  No  claim  as  to  the  sufficiency 
of  this  condition  can  be  made  as  the  stress  intensity  reflects  only  the 
dominant  term  of  the  local  solution.  A  solution  arrived  at  on  this  basis 
must  still  be  tested  for  its  overall  consistency. 

In  the  present  case,  we  consider  as  undetermined  and  impose  the 
vanishing  of  at  A.  Since 

=  -  iT(L^/4)  I  sin^oi  +  (2/L)  (sin  a)  Im  z^  |  (14) 

it  follows  that  =  0  at  A  if  we  choose 

Im  z^  =  -  (L/2)sin  a  (15) 

Although  there  are  no  restrictions  on  Re  z  ,  we  choose  z^  so  that  the  crack 
passes  through  the  origin  of  coordinates,  thus 

z^  =  -  (L/2)Jl^“  (16) 

With  this  choice  of  we  reexamine  the  non -overlapping  condition  (12) , 
On  the  crack, 

2  \  3 

rr  (k  +  l)TL  <4  sin  a  sin  6(1  -  cos  6) 

.2  2  I 

+  cos  a(l  +  2  sin  a)  (1  -  2  sin  0-2  cos  0)  >  /32vi, 

thus, 

u^(6)  -  u^(-0)  =  (k  +  l)TL^sin\  sin  0(1  -  cos  0)/4p  (18) 

which  clearly  satisfies  (12)  for  0  ^  ^  and  hence  is  a  physically 

acceptable  displacement  field. 

The  stress  intensity  factors  in  the  present  case  are 
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(19) 


K 


K 


(2) 

lA 

(2) 

IB 

(2) 

■2B 


t,t3/2  .  3 
TL  sin  a 


3/2  2 

-  TL  sin  a  cos  a 


Furthermore,  it  is  easily  verified  that  the  forces  normal  to  the  segment  AC 
in  Figure  2  are  compressive. 

It  is  clear  that  the  present  solution  can  be  considered  as  a  physically 
acceptable  solution  for  a  central  crack  along  the  segment  BC  where  closure 
occurs  on  the  segment  AC.  We  do  assume,  of  course,  that  the  frictional 
properties  of  the  crack  surfaces  are  consistent  with  a  continuous  displace¬ 
ment  solution  along  AC,  i.e.  closure  without  slippage. 

A  comparison  with  the  previously  derived  classical  solution  for  a 
centrally  located  crack  can  now  be  made  by  observing  the  change  in  the  stress 
intensity  calculation  at  point  B.  The  crack  AC  corresponds  to  a  crack  length 
of  2L  if  an  effective  half  crack  length  of  2L/3  is  used  in  the  calculation 
of  (19).  Thias,  the  "corrected"  stress  intensity  factors  at  B  are 

K,-  =  T(2L/3)^'^^sin\ 

(20) 

=  -  T(2L/3)^'^^sin^a  cos  a 

Since 

=  2(2/3)^'^  (21) 


the  classical  estimate  of  the  stress  intensity  factor  at  B  is  in  error  on 
the  non-conservative  side  by  approximately  nine  percent. 


5.  CENTRAL  CRACK  IN  A  FINITE  STRIP  UNDER  BENDING.  We  consider,  now, 
the  more  difficult  problem  of  a  central  crack  in  a  strip  of  finite  width 
under  bending.  Figure  3,  where  the  solution  cannot  be  foimd  in  closed  form 
and  the  previous  arguments  must  be  carried  out  numerically.  For  the  con¬ 
figuration  in  Figure  3,  Benthem  and  Koiter  [2]  have  estimated  the  crack 
tip  stress  intensity  factors  at  B  for  the  classical  solution  of  the  problem 
by  using  an  effective  asymptotic  argument. 
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Figure  3.  Central  crack  in  a  strip  under  bending. 

The  solution  was  carried  out  using  the  MMC  (Modified  Mapping  Collocation) 
method  combined  with  finite  elements  [3,4].  This  plan  is  based  on  ^’partition- 
ing"  the  region  and  using  a  representation  of  the  solution  appropriate  to 
each  sub -region.  The  boundary  conditions  along  with  appropriate  ^'stitching'* 
conditions  between  the  representations  must  be  satisfied  by  the  solution. 

The  details  of  this  approach  have  been  previously  documented  and  will  not  be 
repeated  here. 

The  partitioning  plan  is  indicated  in  Figure  3.  The  region  MRSN  was 
described  using  the  mapping  function 

z  =  2^  +  i[Z/2)U  +  I/O  (22) 

which  clearly  maps  the  unit  circle  in  the  ^-plane  into  the  crack  AB.  A 
series  representation  of  the  solution  was  chosen  in  the  corresponding  para¬ 
meter  region  and  traction- free  conditions  on  the  crack  were  enforced  by  the 
continuation  argument,  e.g..  Equation  (5).  The  boundary  conditions  on  RS 
and  MN  and  the  stitching  conditions  on  RM  and  SN  were  imposed  by  the  colloca¬ 
tion  arguments  outlined  in  [4].  In  the  complementary  regions  (the  shaded 
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areas  in  Figure  3)  a  finite  element  representation  of  the  solution  taken 
Imposed  on  this  representation  were  the  appropriate  stitching  conditions, 
traction- free  boundary  conditions  and  end  loading, 


0..  = 


(3M/2b3)y 


(23) 


Again,  we  seek  a  value  of  Zq  such  that  the  stress  intensity  Kja  =0 
and  the  crack  displacements  and  the  forces  on  AC  are  physically  conpatible 
with  our  argument.  It  was  found  that  Zq  can  be  determined  quite  readily  by 
iteration.  From  the  infinite  sheet  solution,  it  is  evident  that  for  small 
irt,  ratios,  With  this  as  a  guide  for  the  first  approximation, 

only  a  few  trials  were  required  to  find  the  proper  value  of  Zq  for  successively 
increasing  values  of  £/b. 

The  numerical  results  are  presented  in  Table  1.  Again  the  results  are 
to  be  con^jared  with  the  classical  solution  for  the  central  crack,  Zq  =  0. 

The  effective  half  crack  length,  L,  is  evidently 


L  =  £  + 


Table  1 


"Corrected  Stress  Intensity  Factors,  Kjb> 
for  Central  Crack  in  Strip  under  Bending 


(24) 


L/b 

il/b 

Zo/b 

OA/b 

^IB 

*^18 

*^1B 

Mb-^/2 

Mb-^/2 

0.1 

0.067 

-0.033 

0.033 

0.0259 

0.0237 

1.09 

0.2 

0.133 

-0.067 

0.067 

0.0733 

0.0672 

1.09 

0.3 

0.200 

-0.100 

0.100 

0.136 

0.124 

1.10 

6.4 

0.270 

-0.130 

0.140 

0.213 

0.193 

1.10 

0.5 

0.340 

-0.160 

0.180 

0.304 

0.276 

1.10 

0.6 

0.414 

-0.186 

0.228 

0.417 

0.379 

1.10 

0.7 

0.492 

-0.208 

0.284 

0.567 

0.516 

1.10 

0.8 

0.574 

-0.226 

0.350 

0.796 

0.727 

1.10 

0.9 

0.668 

-0.232 

0.432 

1.280 

1.163 

1.10 

1.0  ' 

0.763* 

-0.237* 

0.526* 

*Extrapolated 

It  is  interesting  to  compare  the  present  results  with  the  classical 
results  Kib  of  Benthem  and  Koiter.  Within  one  percent,  the  classical  solu¬ 
tion  underestimates  the  K^b  values  by  nine  percent  for  all  values  of  L/b. 

6.  MODIFICATION  OF  THE  ASYMPTOTIC  APPROXIMATION.  In  [2],  Benthem  and 
Koiter  introduced  a  non-dimensional  factor  K  by  writing 
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(25) 


Kj  =  K  3LM(aL/b)^^^/2(b^  -  L^) 

where  K  is  a  polynomial  in  L/b.  An  approximate  solution  for  K  was  found  by 
order  of  magnitude  considerations  of  the  two  limiting  cases,  L/b  -»■  0  and 
a/b  0. 

The  modifications  of  their  arguments  for  the  "corrected"  solution  for 
L/b  0  can  now  be  carried  out  by  using  our  solution  for  the  central  crack 
in  an  infinite  sheet.  In  particular,  if  the  order  of  magnitude  considerations 
of  [2]  are  modified  by  Equation  (20),  then,  at  the  crack  tip  B, 


Kjg  (2/3)^/^[3ML^/^/2b^][l  +  0(LVb'^] 

for  L/b  ->  0 

From  a  comparison  of  Equations  (25)  and  (26), 


(26) 


(2/3)^/^[l  +  (l/2)(L/b)  +  (3/8)(L/b)^  -  (11/16) (L/b) ^ 
+  0(L^/b^]  for  L/b  ■>  0 


(27) 


For  the  second  limiting  case,  a/b  0,  by  using  the  anti-symmetry  of 
the  classical  problem  and  the  "edge  dam"  solution,  the  authors  of  [2]  found 


K  2/(Tr^  -  4)^/^  =  0.826  for  a/b  ->•  0  (28) 

Iftifortunately,  due  to  the  non-linearity  of  our  present  solution  no  such 
limit  can  be  rigorously  argued.  On  the  other  hand,  a  reasonable  estimate  of 
this  limit  can  be  found  by  extrapolation  of  the  data.  From  Table  1,  the 
segment  OA  can  be  extrapolated  as  OA  0.52b  as  a/b  -y  0.  Furthermore,  the 
stress  distribution  a  along  the  centerline  from  A  to  the  edge  is  very  nearly 
linear.  From  equilibrim  conditions,  it  can  be  argued  that  the  local  stress 
at  B  is  nine  percent  higher  than  in  the  classical  case.  Thus, 


(1.09)  (0.826)  for  a/b  0  (29) 

(Although  (29)  is  an  extrapolated  estimate,  it  was  verified  that  reasonable 
variations  in  the  approximation  altered  this  result  by  no  more  than  one  percent.) 


Therefore,  the  simplest  polynomial  inteipolation  between  these  asymptotic 
results  yields 


K=  (2/3)^/^[l  +  (l/2)(L/b)  +  (3/8)(L/b)^  -  (11/16) (L/b) ^ 

+  .464  (L/b)^] 


(30) 
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Equation  (30)  is  identical  with  the  K  in  [2]  if  (2/3)^"^  were  replaced  by  1/2. 

7.  EDGE  CRACK  IN  A  STRIP  UNDER  BENDING.  At  about  the  same  time  as 
the  author's  solution  [5],  Paris  and  Tada  [6J  considered  the  solution  for 
an  edge  crack  in  a  strip  under  bending  again  allowing  for  interference  of 
segments  of  the  crack  surfaces.  Figure  4. 


Figure  4.  Edge  crack  in  strip  under  bending. 

It  is  obvious  physically  that  for  C/W  £  1/2,  assuming  no  friction 
between  the  crack  surfaces,  the  admissible  solution  is  one  which  predicts 
the  strip  is  in  uniform  bending  with  the  crack  surfaces  interferring  and 
carrying  a  compressive  load.  For  C/W  >  1/2,  it  is  also  clear  that  the 
solution  is  identical  to  our  results  for  the  central  crack  with  a  modified 
interpretation  of  the  parameters. 

In  Paris  and  Tada’s  analysis,  the  crack  tip  stress  intensity,  K,  was 
approximated  by 

K~  =  G(C/W)H(C/W) 

where 


G(C/W)  =  (2/3)^/^(2C/W)(l  -  W/2C)^/^ 
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based  on  the  solution  for  a  central  crack  in  an  infinite  sheet  under  bending. 
(Note  that  in  Equation  (31),  the  alternate  introduction  of  v^in  the 
definition  of  stress  intensity  factors  has  been  made.)  The  function 
H(C/W)  was  taken  as  the  correction  for  the  effect  of  the  finite  width  of 
the  strip.  Paris  and  Tada  did  not  calculate  H(C/W)  exactly,  instead,  they 
assumed  an  approximation  based  on  the  finite  width  correction  for  a  center 
cracked  finite  width  strip  under  tension.  Their  numerical  results  are 
listed  in  Table  2. 

The  results  which  we  have  derived  can  be  applied  with  the  following 
changes  in  notation. 


L  =  a^  =  C  -  W/2 

b  =  W/2 
a  =  b  -  L 

L/b  =  2(C/W)  -  1  =  A 
M  =  (2/3)b2a 
=  K~ 


(33) 


Then, 

K~/a/7c  =  R(A)A^/^  V 1  -  A/(l  -  A^)  Vl  +  X 

where 

R(X)  =  (2/3) ^/2[1  +  A/2  +  3A^/8  -  llA^/16  +  .464  ^ 
A  comparison  of  the  results  is  shown  in  Table  2. 

Table  2.  Values  of  K~/a  y^C 


(34) 


c/w 

\ 

Equation  fSll 

Equation  C 

0.50 

0.0 

0.0000 

0.0000 

0.55 

0.1 

0.0165 

0.0164 

0,60 

0.2 

0.0453 

0.0445 

0.70 

0.4 

0.129 

0.118 

0.80 

0.6 

0.259 

0.217 
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The  approximation  of  H(C/W)  used  by  Paris  and  Tada  appears  to  exaggerate 
the  stress  intensity  K  for  the  peeper  cracks.  For  cyclic  bending  of 
edge  cracked  strip,  the  moment  M  contributes  to  the  £rack  opening  after  the 
crack  has  reached  the  half  width  of  the  strip.  The  K  contributes  for 
further  crack  growth  can  then  be  determined  from  Equation  (34) . 

8.  OBSERVATIONS.  The  problem  of  "crack  overlapping"  occurs  in  several 
of  the  classical  analyses  found  in  the  literature.  The  results  of  this 
investigation  would  appear  to  indicate  that  the  errors  so  introduced  are 
sufficient  to  warrant  a  more  careful  consideration  of  such  solutions. 
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SUMMARY 

A  finite  element  formulation  is  described  for  problems  with  solution 
functions  known  to  have  local  r^  variation  (s),  0<A<l,and  thus  singular  gradi¬ 
ents.  Special  3-node  triangular  elements  encircle  the  singularity  and  focus  to 
share  a  common  node  at  the  singular  point.  The  shape  function  of  each  triangle 
has  the  appropriate  r^  mode  and  a  smooth  angular  mode  expressed  in  e  emen 
natural  coordinates.  As  with  standard  elements,  the  unknowns  are  the  nodal 
values  of  the  function.  Even  if  the  precise  angular  form  of  the  asymptotic 
solution  is  known,  the  formulation  makes  no  atten^t  to  embed  it,  but  instead 
piecewise  approximates  it.  This  allows  assembly  of  the  element  coefficient 
matrix  using  standard  procedures  without  nodeless  variables  and  bandwidth 
complications . 

The  conditions  of  continuity,  low  order  solution  capability,  and  accurate 
numerical  integration  of  the  singularity  element  are  discussed  with  a  view 
towards  establishing  the  general  range  of  applicability  of  the  formulation. 
Numerical  applications  to  the  elastic  fracture  mechanics  problems  of  composite 
bondline  cracking  and  crack  branching  are  discussed. 

INTRODUCTION 

We  are  considering  here  the  problem  of  attaining  accurate  numerical 
representation  of  a  function  ^  (x,y)  when  near  discrete  points  in  the  domain  (j) 
varies  as  r^,  0<A<  1.  Standard  shape  functions  cannot  properly  model  the 
singular  gradient  of  r  so  our  approach  has  been  to  design  a  special  singularity 
element.  Beyond  embedding  the  proper  singularity  into  the  shape  function,  the 
usual  questions  of  interelement  continuity,  constant  state  representation,  and 
accurate  numerical  integration  are  addressed. 

Interelement  continuity  should  be  maintained  for  <j)  and  its  derivatives 
up  to  one  order  less  than  that  occurring  in  the  governing  volume  integral, 
denoted  by  I,  of  the  problem.  Subsequently,  it  will  be  shown  that  the 
singularity  element  has  (p  interelement  continuity  but  no  guaranteed  con¬ 
tinuity  of  4>  gradients  across  edges.  Strictly  speaking  then,  it  is  limited 
to  problems  where  I  =  I  (<i>,  3^/3  x.).  For  example,  this  is  the 
case  in  the  potential  energy  formulation  of  elasticity  where  the  governing 
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functional  of  displacement  involves  only  first  order  derivatives.  The 
virtual  work  formulation  of  plasticity  is  another  such  case  with  ^ 
representing  the  displacement  increment. 

The  other  finite  element  convergence  criterion^  is  that  an 
element  should  be  capable  of  representing  fields  with  constant  values 
of  ^  ,  or  derivatives  of  i  up  to  the  order  occurring  in  I.  This  is 
necessary  because  in  the  limit  of  vanishing  element  size,  ^  and  its 
derivatives  should,  within  the  element,  equal  the  pointwise  constant 
values.  From  a  practical  standpoint  the  constancy  conditions  are 
important  only  when  constant  state  conditions  exist  over  the  finite 
subdomain  occupied  by  the  element.  The  boundary  conditions  of  a 
singularity  problem  can  cause  smooth  as  well  as  singular  i  variations 
near  the  singular  point.  The  constancy  capability  of  the  elements  at 
the  singular  point  is  important  only  if  the  smooth  terms  are,  on  an  element 
average  basis,  comparable  in  value  to  the  singular  terms.  The  element 
introduced  below  has  i  modes  of  the  constant  and  r^  type.  It  does  not 
have  the  polynomial  terms  necessary  to  represent  non-zero  constant 
derivatives.  Since  the  singular  mode  dominates  the  uniform  mode  as 
the  singularity  is  approached,  the  lack  of  the  latter  mode  is  of  diminishing 
consequence  as  element  size  is  reduced,  and  thus  convergence  is  achievable 
in  this  sense.  However  it  is  clear  that  the  element  is  not  suited  for  problems 
without  an  ’’active”  singular  ity. 

FOIRMULATION 

The  element  described  here  is  a  generalization  of  the  singular 
element  suggested^  for  analysis  of  the  elastic  crack  tip  singularity. 

The  element  is  a  3  node  triangle,  and  has  one  of  its  nodes  at  the  singular 
point.  The  power  form  variation  is  chosen  in  the  direction  away  from  the 
singular  point;  low  order  smooth  variation  is  chosen  in  the  angular  direction. 
Figure  la  illustrates  the  modeling  with  one  of  a  necessary  group  of  triangles 
at  the  singular  point,  node  I.  The  shape  function  is  developed  in  terms  of  the 
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oblique  coordinates  ?,  Ti  which  vary  over  the  range  [  0,  I3within  the 
element.  The  radial  edges  correspond  to  n  =  0,  1.  The  edge  ?-  0  is 
actually  a  point  -  the  singular  point-and  the  far  transverse  edge  is  §  =  1 
The  transformation  to  cartesian  coordinates  follows 

X=X^(l-§)+X^  §(1  -  Tl)  +x  ^  (1) 

It  is  straightforward  to  show  that  %  is  always  a  linear  function  of  r 
times  a  trigonometric  function  of  angular  orientation  within  the  element 
and  that  r|  is  solely  a  trigonometric  function  of  angle.  As  an  example, 
the  isosceles  triangle  of  Fig.  lb  has  the  transformation  equations 


?  =  (r  cos  e)  /  x^  =  x/x^ 

n  =  (tan  e/tana  +  1)  /2  =  (y/x*x^/y^  +  l)/2 


(2) 


With  §  being  a  linear  function  of  r,  ^  varies  as  r  when 
terms  are  chosen  in  the  shape  functions;  such  a  choice  yields  the 
interpolation  function 

(  1  -  5^  +  5^^  (1  -  Ti)  +</>^  (3) 

For  the  isosceles  triangle  this  corresponds  to 

^  (1  -  (x/x^)’')  +1/2  (1-  y/x-x^/y^  )  (x/x^)^ 

(4) 

+  1/2  (1  +  y/x  •  X  /y  )(x/x  ) 

o  o  o 

By  using  a  group  of  these  elements  about  the  singularity,  the 
angular  form  of  the  asymptotic  solution  is  approximated  in  a  piecewise 
smooth  fashion.  The  singular  radial  variation  is  embedded  throughout 

the  region  occupied  by  the  elements. 

On  the  radial  edges  ^  is  a  two  parameter  function,  e.g.  on  IJ 
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(5) 


so  that  there  is  continuity  of  on  these  edges.  On  JK  ^  is  a  linear 
function  of  position  which  guarantees  continuity  with  an  element  such 
as  the  bilinear  isoparametric.  Derivative  continuity  across  element 
edges  is  not  guaranteed  so  that,  as  previously  discussed,  the  element 
strictly  applies  only  to  those  problems  whose  governing  integrals  are 
independent  of  second  and  higher  order  i  derivatives. 

The  element  is  capable  of  representing  a  constaint  <t>  condition 
as  can  be  seen  by  substituting  a  constant  for  the  nodal  values  in  the 
interpolation  function  and  observing  that  ^  then  equals  the  constant. 
Without  a  linear  term  in  the  shape  function  the  constant  first  derivative 
condition  cannot  be  met.  In  analysis  of  deformable  solids  where  <(> 
would  be  the  displacement  function,  situations  such  as  rigid  rotation 
and  uniform  thermal  expansion  correspond  to  a  linear  mode.  The 
element  cannot  directly  accommodate  these  cases,  but  by  choosing  a  small 
enough  element  the  singular  mode  will  dominate  the  exact  solution  making 
the  exclusion  of  the  linear  mode  inconsequential. 

The  singular  nature  of  the  i  gradients  does  not  preclude  the 
possibility  of  accurate  numerical  integration  in  forming  the  coefficient 
matrix.  It  is  assumed  from  the  outset  that  the  r^  variation  gives  rise 
to  an  integrable  singularity.  Standard  methods  of  integration  have  been 
developed  for  polynomial  variations  so  that  these  can  be  used  only  for 
the  angular  integration.  In  general  the  problem  is  to  integrate  terms  of 
the  form 


r 


0 


/ 


0 


f  (?)  ?  d§ 


g  (h)  dq 


(6) 


The  determinant  of  the  Jacobian  ,  b(x,  y)/b(§,  q),  accounts  for  the  factor  ? 
of  the  inner  integrand.  For  the  examples  below  a  2 -point  Gauss  rule  was 
used  for  the  n  integration.  The  form  of  f(§)  must  be  scrutinized  before 
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choosing  a  §  integration  rule.  For  elasticity  the  governing  integral  is  a 
quadratic  function  of  the  shape  function  first  derivatives  and  this  results 
in 


Hence 


/  f  §  d§  =  J  ^  d5  =  l/2\  (8) 

0  0 

For  the  elasticity  examples  below,  the  numerical  technique  employed  to 
achieve  precisely  the  result  (8)  was  a  specialized  1-point  rule:  one 
integration  station  was  used  at  location  §  =  (2X)  ^  ^  ^  and  its  weight 

was  unity.  It  is  easily  appreciated  that  standard  methods  of  integration 
can  be  very  much  in  error  for  this  problem,  particularly  for  X  <  0.  5. 

Hence,  generally  speaking,  detailed  investigation  of  f  (§)  is  required 
for  design  of  an  adequate  integration  procedure. 

EXAMPLES 

The  examples  are  problems  of  elastic  fracture  mechanics.  The  finite 
element  approach  employed  was  that  based  upon  the  principle  of  minimum 
potential  energy,  so  that  ^  of  the  last  section  now  stands  for  the  displacement 
vector  function.  The  first  problem  is  the  bimaterial  elastic  strip  with  a 
pressurized  crack  normal  to  and  terminating  at  the  bondline.  The  geometry  is 
illustrated  in  Fig.  2.  The  material  on  the  left  is  cracked  and  designated  as 
material  1  with  shear  modulus  and  Poisson's  ratio  V^;  material 

2  to  the  right  has  properties  fj.^'  ^2  ‘  length,  plate  width,  and 

height  are  related  by  a/b  =  a/h  =  1/9.  The  left  end  of  the  crack  being 
surrounded  completely  by  one  material  is  a  singular  point  with  displacement 
varying  as  The  bondline  crack  tip  has  a  singularity  dependent  upon 

the  bimaterial  elastic  properties.  Displacement  varies  as  r  with  X  a 
function  of  and  also  the  type  of  planar  deformation,  i.  e.  plane  stress 

vs.  plane  strain^.  The  examples  here  are  plane  strain  and  the  material 

combination  is  aluminum-epoxy.  For  aluminum  p  =  3.846  x  10  psi. 
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V  =  0.  3;  and  for  epoxy  |J.  =  0.  1667  x  10^  psi,  v  =  0.  35.  With  aluminum 
as  the  cracked  material  m=  =  0.  043  and  X  =  0.  1752.  When  epoxy 

is  the  cracked  material  m  =  23.  08  and  X  =  0.  6619. 

Figure  3  shows  the  mesh  used  in  the  crack  location.  Symmetry 
allowed  modeling  just  the  upper  half  of  the  strip.  Isosceles  triangle  s 
with  a  radial  dimension  of  a/100  and  angular  extent  of  15°  were  used 
as  singularity  elements  about  each  crack  tip.  Of  course,  about  each 
tip  the  appropriate  value  of  X  was  used  to  generate  the  element  stiffnesses. 
The  radial  dimension  of  the  singularity  elements  is  a  crucial  aspect  of 
the  finite  element  model.  The  singularity  elements  should  be  entirely 
within  the  region  where  displacement  is  accurately  represented  by 
the  r  form.  The  crack  opening  displacement  data  from  available 
singular  integral  equation  solutions  weiPe  used  to  establish  the 
suitability  of  the  radial  dimension  a/100.  When  there  is  no  basis  for 
judgment  of  the  range  of  dominance  of  the  leading  power  term  in  the  full 
solution,  a  convergence  study  must  be  conducted  by  successively  decreasing 
element  size  to  establish  accuracy  estimates  of  the  singularity  solution. 

Bilinear  isoparametric  elements  were  used  to  model  the  plate 
away  from  the  singularities.  The  total  mesh  involved  429  nodes  and  433 
elements.  The  forces  specified  to  be  acting  on  the  crack  face  nodes  were 
calculated,  in  terms  of  the  uniform  pressure  p,  consistent  with  the  element 
shape  functions.  Thus,  the  singularity  element  node  on  the  crack  face  had 
an  applied  normal  force  per  unit  thickness  equal  to  .  01  pa/{l  +  X). 

Three  features  of  the  solutions  to  be  discussed  are  the  angular 
distribution  of  stress  about  the  bondline  crack  tip,  the  crack  opening 
behavior  near  the  bondline,  and  the  stress  intensity  factors.  The  angular 
variation  of  the  normalized  stress  <7^^/?  through  the  ring  of  bond  tip 
singular  elements  is  given  in  Figure  4.  Data  are  given  for  both  p,  /p, 
combinations.  Along  with  the  finite  element  data  at  the  twelve  discrete 
midpoint  angles,  singular  integral  equation  (SIE)  data  are  given  at  angles 
of  0,  90  and  180°  and  r  =  0.005a.  The  first  striking  characteristic  of 
the  distribution  is  the  discontinuity  of  stress  across  the  bondline. 
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Independent  of  which  material  is  cracked,  at  90®-  the  bondline  -  the 
aluminum  is  stressed  higher  than  the  epoxy.  Hence,  when  the  epoxy  is 
cracked  a  (90’)  exceeds  a  (90''’),  and  just  the  opposite  when 

yy  yy 

the  aluminum  is  cracked.  There  is  very  good  agreement  between  the 
SIE  and  finite  element  solutions  with  the  exception  of  the  90  values 
for  m  =  0.  043.  The  finite  element  mesh  is  perhaps  too  coarse  in  the 


angular  sense  to  accomodate  the  large  gradient  in  the  range  90-180® 
for  m  =  0.043,  so  that  mesh  refinement  might  improve  this  deviation. 

In  Figure  5  the  normalized  crack  opening  displacement  u^/a 
is  plotted  as  a  function  of  distance  from  the  bondline  crack  tip  to 
r/a  =  0.  16  for  the  two  cases.  The  data  corresponds  to  a  unit 

value  of  crack  face  pressure.  The  finite  element  data  appear  in  discrete 
fashion  in  the  plot  and  for  comparison  purposes  the  SIE  solutions  are 
presented  and  are  represented  by  the  solid  curves.  There  is  excellent 
agreement  between  the  solutions  for  m  =  23.  08,  and  this  is  true  over 
the  entire  crack  face,  0  <  r/a  <2.  While  the  SIE  and  finite  element 
data  agree  at  r/a  =  0.  01  for  m  =  0.  043,  the  solutions  differ  by  5-10% 
over  most  of  the  crack  face,  including  near  the  embedded  end.  There 


is  a  dramatic  difference  in  the  opening  behavior  local  to  the  bondline 

for  the  two  cracked  cases.  The  SIE  curves  demonstrate  the  behavior 

0.175  ,  0.662  .  ..  1 

which  is  expected  from  the  r  and  r  asymptotic  displacement 

solutions.  With  epoxy  bonded  to  cracked  aluminum  there  is  a  rapid  gradient 
in  opening  which  is  intuitively  consistent  with  the  stiffness  m.ismatch. 

The  intersection  of  the  two  curves  is  near  r/a  =  0.  01  ,  the  location  of  the 
first  finite  element  node,  and  the  opening  displacements  u^.  / a  there  are 
0.193x10"^  for  m  =  23.  08,  and  0.222  x  10  ^  for  m=  0.043. 

The  stress  intensity  factor,  generalized  for  both  the  embedded  and 
bondline  crack  tips  is  defined  as 


K  =  1  im  / 2 
r->  o 


(r,  o) 


(9) 


To  deduce  K  from  the  displacement  data.the  following  equation  was  used 
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(10) 


K  =  2/2  X  |JL*  (r,TT)/r  ^ 

The  modulus  (x*  is  defined  from  the  relationship 

u  (r,n)  =  r  a  (r,o)/2X|x*  (11) 

y  yy 

fi*  is  an  algebraic  function  of  the  bimaterial  constants  and  the  eigenvalue  X. 

For  the  plane  strain  homogeneous  material  case,  m=l,  is  equal 

to  |ji/2(l-V). 

From  eqn.  (10),  the  stress  intensity  factor  at  the  embedded  tip, 

K/p/a  ,  computed  from  the  finite  element  data  at  r/a=0.  01  was  found  to 

equal  0.89  when  m-23.  08,  and  1.52  when  m=0.  043.  For  a  homogeneous 

plate  the  result  is  1.  00,  and  this  shows  the  degree  to  which  the  aluminum  reduces 

the  severity  of  the  singularity  in  the  cracked  epoxy,  and  how  much  more  severe 

the  singularity  is  in  aluminum  when  epoxy  is  bonded  to  it.  The  values  for 

K/pa  at  the  bondline  crack  ends  are  2.85  for  m=23.  08,  and  0.  112  for 

m=0.  043.  The  SIE  displacement  data  predicts  essentially  the  same  K  values 

with  the  exception  of  the  embedded  tip  m=0.  043  value  which  is  10%  lower, 

consistent  with  the  displacement  deviation  mentioned  above.  A  detailed 

discussion  of  the  results  of  the  bimaterial  crack  problem  will  be  reserved 

4 

for  a  future  specialized  paper  . 

The  second  example  is  the  branch  crack  in  an  elastic  tension  strip. 
Figure  6a.  The  main  crack  emanates  from  the  free  edge  at  45°  and  its 
projected  length  normal  to  the  tension  is  W/4.  W  is  the  strip  width, 
and  3W  is  the  strip  length.  The  branch  normal  to  the  tension  has  length 
W/80.  There  are  two  singularities  in  this  problem  each  with  local  r^ 
displacement  distributions.  The  right  end  of  the  branch  has  the  usual 
crack  tip  singularity  with  X  =  1/2,  while  the  angle  on  the  upper  face  of 
the  crack  is  a  reentrant  corner  with  X  =  0.674.  These  conclusions  are 

5 

drawn  from  the  asymptotic  analysis  of  reference  .  The  finite  element 
mesh  at  the  branch  is  shown  in  Figure  6b.  The  singularity  elements  were 
chosen  to  have  a  radial  extent  5%  of  the  branch  length  and  an  angular  dimension 


of  22.5 
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The  angular  variation  of  the  normalized  polar  stress  0^^/ about 
the  bend  singularity  is  given  in  Figure  7.  The  data  are  from  the  singularity 
element  midpoints.  The  stress  state  is  essentially  entirely  compressive 
with  peak  compression  equal  to  3.  1  9  =  125®.  This  suggests  that 

forking  would  not  occur  from  this  point.  At  the  right  end  of  the  branch 
the  stress  intensity  factors  were  deduced  from  the  singularity 

element  crack  face  nodal  displacements.  If  6  represents  the  relative 
opening  displacement  of  the  nodes  on  the  two  crack  faces  and  A  the 
relative  sliding  displacement,  the  equations  used  to  determine  K^.  and 
Kii  for  this  plane  stress  example  were 


Notice  that  the  factor  /tt  is  not  used  in  these  definitions.  The  value  of 
Kj  was  found  to  be  4%  lower  than  the  value  for  a  normal  to  the  tension 
unbranched  crack  with  length  (1.  05)  W/4, 

K  =  1.49  a  V  (1.05)  W/4 

I  oo 

-2 

Kfi  was  determined  to  be  negligible  in  relation  to  K^,  <  10  .  An 

additional  problem  was  considered  which  had  the  above  geometry  altered  by 
extending  the  branch  length  to  W/40.  Kj  again  was  4%  lower  than  that  of 
the  projected  length  crack 


K  =  1.52  VW  (1-10)  W/4 

I  oo 


and  Kjj/K^<  10 


-4 
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CONCLUSIONS 


The  solutions  to  the  crack  problems  are  judged  to  be  very  accurate. 
The  agreement  between  the  singular  integral  equation  and  finite  element 
results  for  the  bimaterial  problems  supports  this  conclusion.  Certainly 
no  standard  finite  element  formulation  can  be  expected  to  provide  reasonable 
solutions  to  problems  such  as  these.  The  formulation  proposed  here 
allows  routine  analysis  of  a  class  of  singularity  problems  which  heretofore 
has  been  approached  only  with  elaborate  analytical  methods.  The  singular 
element  proposed  is  simple  to  implement  since  it  is  easily  programmed 
using  techniques  which  today  are  commonplace. 
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Mesh  in  Crack  Location,  Bimaterial  Problem 


FIGURE  4 


Bondi ine  Cracktip  Angular  Stress  Variation 
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FIGURE  5 

Crack  Opening  Displacement  vs.  Distance  From  Aluminum- Epoxy  Bondi ine 


1 


FIGURE  6a 

Strip  With  Branch  Crack 


FIGURE  6b 

Mesh  in  Location  of  Branch 
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FIGURE  7 

Polar  Stress  Variation  About  Bend  Singularity,  Branch  Crack  Problem 
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CRACK  TIP  FIELDS  IN  STEADY  CRACK  GROWTH  WITH  LINEAR  STRAIN  HARDENING 


John  C.  Amazigo 

Department  of  Mathematical  Sciences 
Rensselaer  Polytechnic  Institute,  Troy,  New  York  12181 

and 

John  W.  Hutchinson 

Division  of  Engineering  and  Applied  Physics 
Harvard  University,  Cambridge,  Massachusetts  02138 


SUMMARY 

Singular  stress  and  strain  fields  are  found  at  the  tip  of  a  crack 
growing  steadily  and  quasi-statically  into  an  elastic-plastic  strain  hardening 
material.  The  material  is  characterized  by  J2  flow  theory  together  with  a 
bilinear  effective  stress-strain  curve.  Anti-plane  shear,  plane  stress  and 
plane  strain  are  each  considered.  Numerical  results  are  given  for  the  order 
of  the  singularity,  details  of  the  stress  and  strain-rate  fields,  and  the 
near-tip  regions  of  plastic  loading  and  elastic  unloading. 


This  paper  is  to  be  published  in  the  Journal  of  Mechanics  and  Physics 
of  Solids. 
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FINITE-DIFFERENCE  SOLUTION  OF  POISSON'S-EQUATION 
IN  RECTANGLES  OF  ARBITRARY  PROPORTIONS 
J.  Barkley  Rosser 

Mathematics  Research  Center,  University  of  Wisconsin, 

Madison,  Wisconsin 

1 .  Introduction. 

We  consider  the  problem  of  getting  an  approximation  of  reasonably 
good  accuracy  by  finite-difference  methods  for  the  function  u(x,  y) 
which  satisfies  Poisson's  equation 

(l.l)  u(x,  y)  =  f(x,  y) 

inside  a  rectangle  R,  and  satisfies  various  boundary  conditions  on 
the  boundary  of  R.  When  f{x,  y)  =  0,  (l.l)  reduces  to  Laplace's 

equation,  and  the  problem  is  appreciably  simpler. 

This  problem  has  been  much  studied.  A  common  approach  is  to 
cover  R  exactly  with  a  mesh  or  grid  of  small  rectangles,  after  v/hich 
one  can  replace  (l.l)  by  a  finite-difference  approximation  involving 
values  of  u(x,  y)  at  the  grid  points.  One  then  tries  to  solve  this  finite 
difference  analogue  of  (l .  l)  to  a  suitable  degree  of  accuracy.  In  order 
to  employ  this  approach  when  high  accuracy  is  required,  it  has  been 
necessary  to  require  that  the  ratio  of  the  sides  of  R  must  be  rational 
since  use  of  high  order  methods  usually  requires  that  one  cover  R 
exactly  with  a  grid  of  squares.  However,  the  conformal  transformation 
method  of  Papamichael  and  Whiteman  [2]  will  lead  more  often  than  not 

The  author  wishes  to  acknowledge  the  sponsorship  of  the  Uniied 
States  Army  under  Contract  No.  DAAG29-75-C-0024  and  of  the  Science 
Research  Council  under  grant  B/RG  4121  at  Brunei  University. 


to  a  rectangle  in  which  the  ratio  is  not  rational,  and  covering  with  a  grid 
of  squares  is  not  possible.  Even  when  the  ratio  is  rational,  there  may 
be  difficulties.  Suppose,  from  some  engineering  problem,  one  is  confronted 
with  a  rectangle  R  of  base  six  and  five-eighths  and  height  five  and 
seven-eighths.  If  this  is  to  be  covered  exactly  with  squares,  there  must 
be  53N  squares  along  the  base  and  47N  squares  along  a  vertical  side, 
where  N  is  a  positive  integer.  With  such  a  covering,  many  popular 
methods  would  operate  at  less  than  maximum  efficiency. 

Accordingly,  we  will  propose  a  method  of  getting  good  accuracy 
with  moderate  labcV  for  rectangles  of  arbitrary  proportions. 


2.  Formulation  of  the  problem. 

By  rotation,  translation,  and  scaling,  as  needed,  we  can  take 
the  rectangle  R  to  be  that  shown  in  Figure  1 .  By  rotating  through 

o 

another  90  and  translating  and  scaling  again,  if  need  be,  we  can 
assure  that  a  >  tt  .  If  a  =  it,  we  have  a  square,  and  familiar  approaches 
suffice.  So  we  assume  a  >it. 

We  consider  first  the  case  of  Dirichlet  boundary  conditions.  That 
is,  we  wish  to  approximate  the  function  u(x,  y)  which  is  continuous 
on  and  inside  R,  satisfies 

(2*1)  u(x,  y)  =  f(x,  y) 
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The  rectangle  R 
Figure  1 


inside  R,  and  on  the  sides  of  R  satisfies  the  Dirichlet  boundary 
conditions 

(2.2)  u(0,y)  =  g^(y)  0<y<a 

(2. 3)  u(Tr,y)  =  g^(y)  0  <  y  <  a 

(2.4)  u(x,0)  =  hjx)  0<x<Tr 

(2.5)  u(x,  a)  =  h^(x)  0<x<iT. 

Because  we  seek  a  u(x,  y)  which  is  continuous  on  R,  as  well 
as  Inside,  we  are  thereby  assuming  that  g^(y)  and  g^(y)  are 
continuous  for  0  <  y  <  a,  that  h^(x)  and  h^(x)  are  continuous 
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for  0  <x<-n-,  and  that 

(2.6)  g^(0)  =  hjo)  , 

(2-7)  g  (a)  =  h  (0)  , 

o  a 

(2.8)  g  (0)  =  h  (tt)  , 

TT  O 

(2*9)  g  (a)  =  h  (tt)  . 

tr  a 

If  there  should  be  discontinuities  in  the  boundary  conditions,  or 
their  derivatives,  this  would  induce  still  another  source  of  errors  in 
the  solutions,  besides  those  due  to  truncation  and  round  off.  See 
Rosser  [  3].  "Jump"  discontinuities  can  be  "removed"  by  the  methods 
on  pp.  221-222  of  Milne  [4].  More  complicated  discontinuities  can 
sometimes  be  "removed",  but  one  cannot  count  on  doing  this.  For  the 
present  treatment,  we  assume  that  the  boundary  conditions  and  their 
low  order  derivatives  are  continuous.  This  includes  continuity  at  the 

corners,  as  exemplified  by  (2.  6)  through  (2.9).  Or,  if  we  replace  (2.  2) 
by 

u^(0,  y)  =  j^(y)  0  <  y  <  a  , 

then  continuity  of  the  first  derivatives  at  the  corners  would  require 

JolO)  =  h;(0) 

J  (a)  =  h' (0)  . 

a 

3.  Finite-difference  approximations. 

There  are  finite-difference  approximations  of  various  orders.  The 
higher  order  methods  of  solution,  involving  the  higher  order  approximations, 
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can  be  used  effectively  only  when  the  function  f(x,  y)  which  appears 
in  (2.1)  has  suitable  high  order  smoothness;  that  is,  when  it  is  continuous 
and  has  continuous  derivatives  of  suitable  orders.  Thus  the  reader  must 
exercise  discrimination  in  choosing  which  order  method  to  use.  When 
they  can  be  used,  the  high  order  methods  permit  the  use  of  coarse 
meshes.  This  can  greatly  reduce  the  labor  of  computation. 

For  difference  approximations  of  order  2,  one  can  use  mesh  elements 
which  are  rectangles,  rather  than  sguares.  See  Hockney  [!]•  In  this 
case,  there  would  be  no  trouble  if  the  ratio  of  the  sides  of  R  were 
irrational.  For  difference  approximations  of  order  4,  one  can  also  use 
mesh  elements  which  are  rectangles.  See  Rosser  [  5].  For  difference 
approximations  of  order  6,  it  appears  that  the  mesh  elements  have  to  be 
squares.  Details  are  presented  in  Rosser  [  5] .  If  f(x,  y)  in  (2.1)  is 
sufficiently  smooth,  this  permits  one  to  use  quite  a  coarse  mesh,  greatly 
reducing  the  computational  labor.  However,  this  raises  the  question 
how  to  proceed  if  the  ratio  of  the  sides  of  R  is  irrational. 

4.  Ill-proportioned  rectangles. 

We  take  h  to  be  the  side  of  the  square  mesh  element.  We 
arrange  that  the  squares  can  be  fitted  along  the  base  of  R.  That  is, 
we  take  M  to  be  a  positive  integer ,  and  define 

(4.1)  ^=M- 
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We  take  N  to  be  the  integer  part  of  aM/rr;  in  symbols 


Then 

(4.3)  Nh<a, 

(4.4)  (N  +  1)  h  >  a  . 

If 

(4.  5)  Nh  =  a  , 

then  we  can  fill  up  the  rectangle  R  exactly  with  MN  squares  of 
side  h,  and  the  methods  of  Rosser  [  5]  are  applicable.  So  we  are 
interested  only  in  the  case  Nh  <  a.  We  could  assume  this,  but  it  is 
not  required  for  the  analysis  which  follows.  If  we  should  have  (4.  5) 
holding,  then  some  of  the  steps  of  the  subsequent  analysis  would  be 
quite  trivial  but  not  incorrect  in  any  way. 

We  begin  by  defining 
(4.  6)  b  =  Nh 

(4.7)  c  =  a  -  b  =  a  -  Nh  . 

We  take  R^^  to  be  the  rectangle  with  corners  (0,0),  (0,b),  (11,0),  and 
(it,  b),  and  take  R^  to  be  the  rectangle  with  corners  (0,c),  (0,  a), 

(ir,  c)  and  (tt,  a). 

We  choose  ^]2(^)  t)e  a  smooth  function  such  that 

hb(Tr)  g^(b)  . 
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The  better  we  can  choose  h|^(x)  to  approximate  u(x,  b)-,  the  more  we 
can  curtail  certain  computations  later.  With  the  limited  information 
available  at  this  stage,  we  content  ourselves  with  taking 

hj^(x)  =  h^(x)  +  (1  -  7  )(go(b)  -  h^(0))  +  J  (g^{b)  -  h^(TT))  . 

We  take  Uj^(x,  y)  to  be  the  function  which  is  continuous  on  and 
inside  Rj^,  satisfies  (2.1)  inside  Rj^,  and  on  the  sides  of  Rj^ 
satisfies  the  boundary  conditions 

(4.8)  u^(0,y)  =  g^(y)  0<y<b 

(4.9)  =  g^(y)  0<y<b 

(4.10)  Uj^(x,  0)  =  h^(x)  0<x<ir 

(4.11)  Uj^(x,  b)  =  hj^(x)  0<x<iT. 

We  take  u  (x,  y)  to  be  the  function  which  is  continuous  on  and  inside 
c 

R  satisfies  (2.1)  inside  R  ,  and  on  the  sides  of  R  satisfies  the 
c’  c 

boundary  conditions 


(4.12) 

Uc(0,y)  =  gj.y) 

c  <  y  <  a 

(4.13) 

u  (TT,y)  =  g  (y) 

C  tt 

c  <  y  <  a 

(4.14) 

c)  =  Uj^(x,  c) 

0  <  X  <  ir 

(4.15) 

u  (x,  a)  =  h  (x) 
c  a 

0  J<  X  <  IT  . 

By  our  definition  of  hj^(x),  we  see  that  Uj^(x,y)  has  continuous 

boundary  conditions  around  the  rectangle  Rj^.  Then  it  follows  by  (4. 14) 

that  the  same  holds  for  u  (x,  y)  relative  to  the  rectangle  R^.  This 

c 

is  why  in  (4.  8)  through  (4,15)  we  can  use  <  rather  than  <. 
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By  (4.1)  and  (4.  6)  we  can  fill  up  the  rectangle  exactly  with 

MN  squares  of  side  h.  Thus  we  can  use  the  9 -point  difference 

approximation  of  Rosser  [  5]  to  get  accurate  approximations  for  u  (x  y) 

inside  R|^  at  the  grid  points  (mh,nh).  From  these,  we  can  get 

accurate  approximations  for  Uj^(mh,  c).  By  (4.14)  these  are  part  of 

the  boundary  values  for  u^(x,  y).  Thus  it  is  necessary  to  determine 

them  to  order  h  .  By  the  principle  of  the  maximum,  it  is  also  sufficient. 

For  a  given  m,  the  point  (mh,  c)  is  on  a  vertical  grid  line.  Thus 

one  can  determine  Uj^(mh,c)  to  order  h^  by  using  a  high  order 

interpolation  formula  in  one  dimension  on  the  values  at  the  six  grid 

points  (mh,0),  (mh,h),  (mh,  2h),  (mh,  3h),  (mh,  4h),  and  (mh,  5h). 

By  (4.14),  this  gives  us  good  approximations  to  u  (x,  c)  at 

c 

X  =  h,  2h,  . .  . ,  (M  -  l)h.  By  (4. 1)  and  (4.7)  we  can  fill  up  the  rectangle 

^c  with  MN  squares  of  side  h.  Thus  we  can  use  the  9-point 

difference  approximation  of  Rosser  [  5]  to  get  accurate  approximations 

for  u^(x,  y)  inside  R^  at  the  grid  points  (mh,  c+nh).  Then  we 

can  get  accurate  approximations  for  u  (mh,  b)  by  the  method  mentioned 

c 

earlier. 

We  define  R^^^  to  be  the  rectangle  which  is  the  intersection 
of  the  rectangles  R^  and  R  In  R  the  function  u  (x,  y)  -  u  (x,  y) 
is  harmonic.  Also,  it  is  zero  along  the  bottom  and  along  the  two 
vertical  sides.  So  on  and  inside  R,  we  have 
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(4.16) 


/  \  (  \  y  ^  sinh  r(y  -  c). 

Uc(x,  y)  -  U|^(x,  y)  =  I  a_. 

r=l 


sin  rx 


where 


(4.17) 


IT 


a  =  -  r  (u  (x,  b)  -  u.  (x,  b)}sin  rx  dx 
r  TT  ^  c  '  b 


Clearly  the  |a^l  are  bounded  by 


(4.18) 


2  max  |u  (x,  b)  -  Uj^(x,  b) 
0  <x  <Tr  ^ 


We  recall  (see  (4.11))  that 

Uj^(x,b)  =  hj^(x)  . 

Presumably  u  (x,  b)  is  fairly  close  to  u(x,  b).  If  also  we  were  lucky 
c 

enough  to  choose  fairly  close  to  u(x,  b),  then  by  (4.18)  the 

a  will  be  fairly  small.  This  will  save  computational  effort  later, 
r 


On  and  inside  R  define 


00 


(4.19) 

where 

(4.20) 


sinh  r(a  -  yj 


v(x,  y)  =  Tj  a  b  - r“r - -  sin  rx  , 

''\  i  y/  u  r  r  sinh  ra 


r=l 


b  = 


sinh  rc 


r  sinh  r(b  -  c) 


On  and  inside  R,  define 
b 


00 


sinh  r(y  ^ 


y  ,  ,  \  V  Sinn  n  y  " 

(4.  21)  u(x,  y)  =  u  (x,  y)  +  v(x,  y)  +  2y  sinh  r(b  -  c) 

r=l 
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We  see  that  u(x,  y)  is  continuous  on  and  inside  the  rectangle 

R.  ,  satisfies  (2.1)  inside  R,  ,  and  on  three  sides  satisfies  the  boundary 
b  b 

conditions  (4.  8),  (4.9),  and  (4.10).  By  (4.16),  we  see  that  on  and 

inside  R,  we  have 
be 

(4.22)  u(x,  y)  =  u  (x,  y)  +  v(x,  y)  . 

We  use  (4.22)  to  define  u(x,  y)  for  the  rest  of  the  rectangle  R  . 

c 

Then  u(x,  y)  is  continuous  on  and  inside  the  rectangle  R  ,  satisfies 

o 

(2.1)  inside  R  ,  and  on  three  sides  satisfies  the  boundary  conditions 
c 

(4.12),  (4.13),  and  (4.15). 

Thus  we  see  that  u(x,  y)  is  exactly  the  function  u(x,  y)  that 
we  were  seeking  to  obtain. 

We  have  obtained  accurate  approximations  for  v)  ^nd 

u  (x,  y)  at  various  grid  points.  If  M  is  of  reasonable  size,  then  c 
c 

is  small,  since  0<c<h  by  (4. 7),  (4.3),  and  (4.4).  As  a  is 

greater  than  tt,  and  b  =  a  -  c  by  (4.7),  we  see  that  the  series  on 

the  right  of  (4.19)  is  rapidly  convergent  for  0  <  y  <  a.  Also,  the  series 

appearing  on  the  right  of  (4.  21)  is  rapidly  convergent  for  small  y, 

certainly  for  0  <  y  <  h.  If  in  addition  the  a^  are  all  quite  small  (see 

(4.18)),  then  very  few  terms  of  the  series  are  needed  to  get  high  accuracy. 

So,  using  the  known  approximations  for  U|^(mh,nh),  we  can  get 

approximate  values  for  u(x,  y)  for  small  y  by  (4.  21).  For  all  other 

values  of  y,  we  can  use  the  known  approximations  for  u  (mh,  c  +  nh) 

c 

to  get  approximate  values  for  u(x,  y)  by  (4.  22). 
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The  calculation  of  the  a^  presents  no  problem.  Not  more  than 

four  or  five  will  be  required;  fewer  if  the  a^  are  all  small.  Observe 

that  the  values  of  Uj^(x,  b)  are  given  by  (4.11).  Also,  we  had  got 

accurate  approximations  for  u  (mh,  b).  So  we  can  use  a  numerical 

c 

quadrature  formula  to  calculate  the  a^  by  (4.17). 

CAUTION.  If  r  is  not  fairly  small  compared  to  N,  then  there 

will  be  fairly  few  abscissa  points  in  each  cycle  of  sin  rx  in  (4.17); 

in  such  case  the  usual  quadrature  formulas  are  not  trustworthy.  One 

can  get  twice,  or  four  times,  or  eight  times,  as  many  abscissa  points 

by  interpolating  to  get  approximations  for  u  (x,  b)  at  the  additional 

o 

abscissa  points  (recall  that  Uj^(x,  b)  is  given  by  (4.11)).  For  this 

interpolation  one  can  use  a  high  order  one  dimensional  interpolation 

formula  on  the  values  u  (0,  b),  u  (h,  b),  u  (2h,  b),...  . 

c  c  c 

We  need  high  accuracy  for  only  the  first  one  or  two  of  the  a^, 
because  of  the  very  rapid  convergence  of  the  series  appearing  on  the 
right  of  (4.19)  and  (4.  21).  In  any  case,  one  should  increase  the  number 
of  abscissa  points,  as  needed,  to  the  point  where  one  can  use  a 
quadrature  formula  with  assurance.  Also,  by  a  little  foresight  in  the 
choice  of  M,  one  can  arrange  that,  after  increasing  the  number  of 
abscissa  points  if  needed,  one  can  use  a  high  order  quadrature  formula, 
like  Bode's  Rule,  for  example. 
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5.  Tests  for  accuracy. 


One  advantage  of  using  the  9-point  difference  approximation  when 
one  can  exactly  fill  up  the  rectangle  v^ith  squares  is  that  one  can  make 
a  first  calculation,  for  less  than  a  quarter  of  the  calculating  effort, 
with  squares  twice  as  large  on  a  side,  and  then  repeat  with  the  smaller 
squares.  Because  the  error  is  of  the  order  of  h^,  one  can  get  an 
estimate  of  the  error. 

This  can  be  done  with  the  present  procedure  by  choosing  M 
divisible  by  2.  If  N  is  not  divisible  by  2,  the  values  of  b  and  c 
which  are  used  with  the  squares  of  side  2h  will  not  be  the  same  as 
those  which  are  used  with  the  squares  of  side  h.  However,  this 
does  not  matter. 

One  dividend  that  will  accrue  from  making  an  initial  calculation 
with  squares  of  side  2h  is  that  from  this  calculation  one  can  derive 
a  very  good  approximation  to  take  for  hj^(x).  Then,  for  the  calculation 
with  squares  of  side  h,  the  a^  will  be  very  small,  so  that  not  more 
than  two  or  three  of  them  will  be  needed. 

6.  Neumann  boundary  conditions. 

Suppose  we  have  the  same  rectangle  R,  and  impose  on  u(x,  y) 
the  same  conditions  as  before,  except  that  on  top  of  the  rectangle  R 
we  specify  values  to  be  taken  by  u^(x,  a).  That  is  we  replace  (2.5) 
by  the  Neumann  condition 

(6.1)  Uy(x,  a)  =  kg(x)  0  <  X  <  TT  . 
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We  postpone  to  the  latter  part  of  the  section  a  disOussion  of  how 
one  would  handle  this  in  the  case  in  which  a/ir  is  rational,  so  that 
one  can  fill  up  R  exactly  with  squares  of  side  h.  For  the  moment, 
let  us  assume  that  this  can  be  done,  and  explain  how  to  generalize  to 
the  case  in  which  a/ir  is  irrational. 

We  proceed  very  nearly  as  in  Section  4.  Instead  of  the  definition 
given  there  of  hj^(x),  we  use 

(6.2)  hj^(x)  =  (1  -  ^)  g^{b)  +  ^g^(b)  . 

We  take  u,  (x,  y)  as  before,  but  for  u  (x,  y)  we  replace  (4.15)  by 

D  C 

the  analogue  of  (6.1),  namely 


(6.  3) 


u  (x,  a)  =  k  (x)  0  <  X  <  IT  . 

ay  c  ’  a 


Everything  now  goes  the  same,  down  to  the  definition  of  v(x,  y).  Let 
us  pause  a  moment,  and  think  what  we  require  of  v(x,  y).  Clearly  it 
should  be  harmonic,  so  that  u(x,  y),  as  defined  in  part  by  (4.  21)  and 
in  part  by  (4.  22),  will  satisfy  (2. 1)  inside  R.  Also,  we  wish  v(x,y) 
to  be  zero  on  the  vertical  sides  of  R,  so  that  there  u(x,  y)  will 
satisfy  the  proper  boundary  conditions.  Also,  on  the  bottom  of  R,  we 


must  have 


(6.4) 


v(x,  0)  =  Yj  \ 
r=l 


sinh  rc 
sinh  r(b  -  c) 


sin  rx 


0  <  X  <  TT 


so  that  by  (4.  21)  u(x,  y)  will  satisfy  the  right  boundary  conditions 
on  the  bottom  of  R.  Finally,  looking  at  (4.  22),  we  see  that  if  u(x,  y) 


is  to  satisfy  the  right  boundary  conditions  on  the  top  of  R,  we  must  have 
(6.  5)  a)  =  0  0<x<TT. 

All  these  conditions  can  be  met  by  simply  replacing  the  factor 

sinh  r(a  -  y) 
sinh  ra 

in  the  definition  of  v(x,  y)  by 

cosh  r(a  -  y) 
cosh  ra 

In  this  case,  since  it  is  unlikely  that  (6.2)  makes  h^(x)  come 
out  very  close  to  u(x,  b),  we  cannot  count  on  the  a^  being 
particularly  small,  so  that  two  or  three  more  of  them  might  have  to  be 
calculated.  It  might  be  better  to  turn  the  rectangle  R  upside  down 
and  proceed  as  follows. 

Consider  next  the  case  in  which  the  Neumann  condition  is  at  the 
bottom  of  R.  That  is,  u(x,  y)  satisfies  (2.  2),  (2.3),  and  (2.  5), 
but  (2.4)  is  replaced  by 

(6.  6)  0  <  X  <  IT  . 

Again,  we  proceed  nearly  as  in  Section  4.  We  can  now  take 

hj^(x)  the  same  as  in  Section  4,  which  should  lead  to  smaller  values 

of  the  a  ,  so  that  we  can  get  by  with  calculating  fewer  of  them.  For 
r 

the  definition  of  u^(x,  y),  we  replace  (4.10)  by  the  analogue  of 
(6.  6),  namely 

(6.  7)  u,  (x,  0)  =  k  (x)  0  <  X  <  TT  . 

'  ay  b  ’  o' 
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We  take  u  (x,  y)  as  in  Section  4,  and  continue  the  same  down  to  the 
c 

definition  of  v(x,  y).  A  key  requirement  is  that  u(x,  y),  as  defined 
by  (4.  21),  shall  satisfy  the  proper  boundary  conditions  at  the  bottom  of 
R.  In  Section  4,  this  required  that 


(6.8) 


v(x,  y)  +  ^ 


sinh  r(v  -  c) 
sinh  r(b  -  c) 


sin  rx 


should  be  zero  when  y  =  0,  This  was  accomplished  by  the  proper 

choice  of  the  b  .  Now  we  must  assure  that  the  partial  derivative  of 
r 

(6.8)  with  respect  to  y  shall  be  zero  when  y  =  0.  Again,  this  is 
accomplished  by  the  proper  choice  of  the  b^;  specifically  we  now  take 


(6.9) 


^  _  -sinh  ra  cosh  rc 
r  sinh  r(b  -  c)  cosh  ra 


All  else  remains  the  same. 

Next  consider  the  case  in  which  there  are  Neumann  conditions 

both  at  the  top  and  the  bottom  of  R.  That  is,  u(x,  y)  satisfies  (2.2) 

and  (2.  3),  but  (2.  4)  is  replaced  by  (6.  6)  and  (2.  5)  is  replaced  by  (6.1). 

We  proceed  much  as  in  Section  4.  In  the  definition  of  v)  we 

replace  (4.10)  by  (6.7),  and  in  the  definition  of  u  (x,  y)  we  replace 

c 

(4.15)  by  (6.  3).  We  define  hj^(^)  by  (6.2).  It  is  then  easily  verified 
that  we  should  replace 


Sinn  r(a  - 
sinh  ra 


in  the  definition  of  v(x,  y)  by 


sh  r(a  - 
cosh  ra 
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and  define 


,,  ...  ,  _  cosh  ra  cosh  rc 

^  ^  r  “  sinh  r{b  -  c)  sinh  ra  ' 

One  can  of  course  have  Neumann  conditions  on  one  or  both  of  the 

vertical  sides.  Let  us  consider  first  the  case  in  which  there  are  Neumann 

conditions  on  both  vertical  sides,  but  Dirichlet  conditions  at  the  top  and 

o 

bottom.  Rotation  by  90  would  reduce  this  to  the  case  just  considered. 


However,  this  is  not  desirable,  since  we  would  then  lose  the  qualifica¬ 
tion  that  the  height  is  greater  than  the  base.  It  was  this  that  assured 
the  rapid  convergence  of  the  Fourier  series  in  (4.19)  and  (4.  21). 

So  we  assume  that  (2. 4)  and  (2.  5)  hold,  but  that  (2.  2)  and  (2.  3) 
are  replaced  by 

(6.11)  u  (0,y)  =  j  (y)  0<y<a 

-X.  o 

(6.12)  u  (tt,  y)  =  j  (y)  0<y<a. 

•X  TT 

We  proceed  analogously  to  Section  4,  except  that  we  use  cosines 
instead  of  sines  throughout.  Because  it  is  desirable  to  have  u  (x,  y) 
continuous  around  the  boundary  we  define 


(6.13)  hj^(x)  =  h^(x)  +  ^  (x  -  •^)^(h^(0)  -  j^(b))  +  ^  ()^(b)  “  h^(iT))  . 
We  define  u  (x,  y)  and  u  (x,  y)  as  in  Section  4,  except  that 

D  C 

they  now  have  Neumann  conditions  on  their  vertical  sides.  We  replace 
(4. 16)  and  (4.17)  by 
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where 


(6.15)  f  /  {u  (x,  b)  -  u  (x,  b)}dx 

u  IT  Q  e  u 

2  ^ 

(6.16)  a  =  -  /  {u  (x,  b)  -  ujx,  b)}cos  rx  dx  . 

'  '  r  TT  C  D 

When  r  =  0,  we  define 

sinh  r(v  -  c)  _  v  -  c 
sinh  r(b  -  c)  b  -  c  * 

Exactly  analogous  changes  are  made  in  (4.19)  and  (4.21). 

If,  in  addition  to  the  Neumann  conditions  on  the  vertical  sides, 
we  replace  one  or  both  of  the  Dirichlet  conditions  on  the  top  or  bottom 
by  Neumann  conditions,  we  can  modify  the  procedure  just  outlined 
quite  analogously  to  the  way  in  which  we  modified  the  procedure  of 
Section  4  earlier  in  this  section. 

It  will  be  noted  that  we  are  allowing  the  possibility  of  Neumann 
conditions  on  all  four  sides.  For  this,  there  will  be  a  solution  only  if 
the  boundary  conditions  satisfy  a  certain  criterion.  If  they  do,  the 
solution  is  not  unique,  but  any  two  solutions  differ  by  a  constant.  The 
procedure  outlined  will  produce  one  of  this  infinity  of  solutions  if  and 
only  if  there  is  a  solution. 

To  handle  the  case  of  a  Dirichlet  condition  on  the  left  side  and  a 
Neumann  condition  on  the  right  side,  we  replace  sin  rx  by 

sin(r  -  2  > 
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with  suitable  related  changes.  To  handle  the  case  of  a'Dirichlet 
condition  on  the  right  side  and  a  Neumann  condition  on  the  left  side, 
we  replace  sin  rx  by 

cos(r  -  j)x  . 


We  consider  finally  how  to  handle  the  case  in  which  the  rectangle 
has  a  rational  ratio  of  the  sides,  and  we  have  filled  it  exactly  with 
squares  of  side  h,  and  wish  to  approximate  u(x,  y)  at  the  grid  points. 
At  interior  grid  points,  we  can  use  one  of  the  formulas  of  Rosser  [  5]. 

On  boundaries  where  there  are  Dirichlet  boundary  conditions,  we  assign 
u^  ^  the  specified  value.  This  leaves  only  the  boundary  points  where 
there  is  a  Neumann  condition  to  be  dealt  with.  Suppose,  for  example, 
that  the  condition  (6.11)  holds  on  the  left  side  of  R.  We  note  that 

1  'ZJ 

(6.17)  +  h,  y)  -  5f(x  +  2h,  y) 


+  Y  f(x  +  3h,  y)  -  4  f(x  +  4h,  y)  +  ■“  f(x  +  5h,  y) 

holds  to  within  terms  of  order  h^.  If  we  take  x  =  0  and  y  =  nh, 
we  get  by  (6. 11) 


(6.18) 


hj  (nh)  £ 
o 


137  - 
60  ^0,n 


+  5u 


l,n 


-  5u 


2,  n 


+ 


10  - 

T 

3  3,  n 


5  - 

7  ’J.i  + 
4  4,  n 


1  - 
7  u. 

5  5,  n 


One  could  use  a  higher  order  formula  than  (6.17),  but  it  probably 

suffices.  A  heuristic  argument  for  this  is  as  follows.  By  the  principle 

6 

of  the  maximum,  if  we  wish  to  determine  interior  points  to  order  h  , 


70 


it  is  sufficient  to  determine  the  boundary  points  to  order  h^.  However, 

if  the  interior  points  are  given  to  order  h^,  (  6.18)  will  determine 

u^  to  order  h^. 

0,n 

Use  of  (6.18)  with  the  formulas  of  Rosser  [  5]  results  in  a  rather 

messy  matrix  of  coefficients  of  the  u  .  However,  one  is  probably 

m,  n  ’  j 

using  such  a  coarse  mesh  that  this  matrix  would  be  less  than  100  x  100, 
perhaps  even  less  than  50  x  50.  If  so,  probably  the  quickest  method  of 
solution  is  to  use  the  standard  computer  routine  for  solving  simultaneous 
linear  equations.  If  this  is  done,  it  does  not  much  matter  if  the  matrix 
is  messy  or  not. 

If  it  happens  that  one  is  solving  the  Laplace  equation,  with 
y)  =  0,  and  has  a  zero  normal  derivative  along  one  side,  say 
j^(y)  -  0,  one  can  use  the  reflection  principle  to  replace  (6.18)  by 
something  which  seems  conceptually  simpler.  However,  it  involves 
three  boundary  grid  points  and  three  interior  points,  and  so  is  probably 
about  as  much  bother  on  a  computer  as  (6.18),  which  also  involves  six 
grid  points. 

If  one  has  Neumann  conditions  on  one  or  more  sides,  and  so  is 
using  (6.18),  one  might  consider  the  following  procedure,  which  would 
bypass  the  treatment  in  Section  4  altogether.  Almost  always,  there  is 
at  least  one  side  with  Dirichlet  conditions.  By  rotating  and  relinquishing 
the  qualification  a  >  ir,  if  need  be,  we  can  arrange  to  have  Dirichlet 
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conditions  on  top.  If,  in  the  notation  of  Section  4,  we  liave  0  <  c  <  h, 
the  difficulty  is  that  we  have  no  good  way  to  write  down  an  equivalent 
of  (3.7)  of  Rosser  [  5]  for  the  values  of  u(x,  y)  at  the  row  of  grid 
points  (mh,  Nh),  l<m<M-l.  As  a  substitute,  write  down  (3.7)  of 
Rosser  [  5]  for  the  9-point  formula  centered  at  (mh,  a  -  h).  It  involves 
values  of  u(x,  y)  at  ((m  -  l)h,  a  -  h),  ((m  -  l)h,  a  -  2h),  (mh,  a  -  h), 

(mh,  a  -  2h),  ((m  +  l)h,  a  -  h),  ((m  +  l)h,  a  -  2h),  as  well  as  at  the 

boundary  points  ((m  -  l)h,  a),  (mh,  a),  and  ((m  +  l)h,  a),  at  which 
latter  points  u(x,  y)  is  known.  Now,  by  a  high  order  one  dimensional 
interpolation  formula,  we  can  write  each  of  u(rh,  a  -  h)  and  u(rh,  a  -  2h), 
approximately  as  a  linear  combination  of  u(rh,  nh)  for  n  <  N;  we  do 

this  for  r=m-l,  r  =  m,  and  r  =  m  +  1.  So  we  get  a  formula 

involving  u(rh,  Nh),  u(rh,  (N  -  l)h),  etc.,  for  r  =  m  -  1,  m,  m  +  1, 
which  we  can  use  in  place  of  (3.7)  of  Rosser  [  5] .  Probably  interpolation 
of  order  eight  should  be  used.  This  makes  the  matrix  still  messier,  but 
if  v/e  are  having  to  deal  with-  a  messy  matrix  anyhow,  because  of  the 
Neumann  conditions,  the  idea  might  be  worth  considering. 
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ABSTRACT.  This  paper  presents  a  variational  formulation  which 
treats  initial  value  problems  and  boundary  problems  in  a  unified 
manner.  The  basic  ingredients  of  this  theory  are  (1)  adjoint  variable 
and  (2)  unconstrained  variations.  It  is  an  extension  of  the  finite 
element -unconstrained  variational  formulation  used  previously  in  solving 
several  nonconservative  stability  problems.  The  technique  which  makes 
this  extension  possible  is  described.  This  formulation  thus  enables 
one  to  adapt  such  numerical  technique  as  the  finite  element  method, 
which  has  had  great  success  and  popularity  for  solution  of  boundary 
value  problems,  for  solutions  of  initial  value  problems  as  well.  These 
formulations  are  given  here  for  a  forced  vibration  problem,  a  heat 
(mass)  transfer  problem  and  a  wave  propagation  problem.  Numerical 
calculations  in  conjunction  with  finite  elements  for  two  specific 
examples  are  obtained  and  compared  with  known  exact  solutions. 

1.  INTRODUCTION.  In  its  application  to  the  solutions  of  engineering 
problems,  the  finite  element  discretization  has  been  implemented  almost 
exclusively  to  the  spatial  dimensions.  For  dynamic  or  time -dependent 
problems  whose  solutions  as  functions  of  time  are  of  interest,  a  step- 
by-step  procedure  of  finite  difference,  i.e.,  the  quasi-static  approach 
is  usually  employed.  The  answer  to  the  question  why  the  time  dimension 
has  not  been  treated  equally  with  the  spatial  variables  in  the  finite 
element  discretization  must  be  related,  in  part  at  least,  to  the 
development  of  variational  methods,  since  the  finite  element  procedure 
can  be  viewed  most  readily  as  an  extremizing  sequence  associated  with  a 
variational  statement.  While  there  are  numerous  variational  principles 
for  boundary  value  problems,  few  exist  for  initial  value  problems.  Like 
many  problems  involving  nonconservative  forces,  the  difficulty  appears  to 
be  that  initial  value  problems  are  nonself-adjoint  and  thus  they  do  not 
possess  variational  principles  in  the  classical  sense.  In  conjunction 
with  problems  involving  nonconservative  forces,  certain  constrained 
variational  principles  (sometimes  called  extended  Hamilton’s  principles 
-See,  for  example,  ref.  [1])  were  used  for  finite  element  solution 
formulations  [2,  3].  Shortly  afterwards,  using  the  combined  notion  of 
the  Lagrange  multipliers  and  the  adjoint  variable,  some  unconstrained 
variational  statements  were  established  and  used  as  bases  for  finite 
element  solutions  [4,  5].  This  approach  has  been  shown  to  be  more 
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advantageous  in  terms  of  simplicity, versatility  and  the  rate  of  conver¬ 
gence  compared  with  the  constrained  variational  approach  [5,  6]. 

Fried  was  first  to  treat  the  time-dimension  identically  with  the 
space  dimensions  in  using  the  finite  elements  [7].  His  solution 
formulations,  however,  emanate  from  constrained  variational  principles. 

In  contrast,  this  paper  presents  a  generalization  of  the  unconstrained 
variational  approach  to  time-dependent  problems. 

At  this  point,  the  variational  principles  of  integrals  of  convo¬ 
lution  developed  by  Gurtin  [8,  9]  should  be  mentioned.  The  applications 
of  these  principles  in  conjunction  with  finite  elements  in  the  time- 
dimension  [10,  11,  12,  13]  have  so  far  failed  to  show  any  advantage  over 
the  procedure  described  by  Fried.  In  fact,  all  these  analyses  had  to 
resort  to  either  the  Fried’s  or  some  other  similar  step-by-step  procedure 
to  complete  the  solutions  in  the  time -dimension. 

In  this  paper,  the  use  of  unconstrained  variational  principles  - 
finite  elements  for  usual  boundary  value  problems  is  first  illustrated 
and  the  advantages  over  the  constrained  formulations  are  pointed  out. 

The  unconstrained  variational  principles  can  always  be  constructed 
through  the  use  of  the  Lagrange  multipliers.  The  unconstrained  vari¬ 
ations  are  then  shown  to  lead  naturally  to  (nonself-)  adjoint  variational 
statements.  Thus,  nonconservative  problems  can  be  formulated  easily 
using  finite  elements.  The  application  to  a  control  problem  is  given 
[14] .  With  the  introduction  of  a  cross-product  term  involving  two-point 
boundary  (initial)  values,  the  unconstrained  variational  -  finite  element 
formulation  is  again  easily  extended  to  include  time-dependent  problems. 
This  formulation  is  obviously  simpler  compared  with  those  derived  from 
Curtin’s  variational  principles  because  no  convolutional  integrals  are 
needed.  It  is  also  easier  to  use  and  more  versatile  than  the  Fried’s 
procedure  due  to  the  fact  that  no  boundary  or  initial  conditions  are 
involved  in  the  solution  formulation  and  because  of  the  nature  of  the 
Lagrange  multipliers.  As  further  examples  of  application,  finite 
element  matrix  equations  are  derived  for  several  transient  problems 
including  a  force  vibration,  a  heat  transfer  and  a  wave  propagation 
problem.  Detailed  formulations  and  numerical  results  of  two  examples 
are  given  and  comparisons  with  some  known  exact  solutions  are  made, 

2.  LAGRANGE  MULTIPLIER  AND  FINITE  ELEMENT  FORMULATIONS.  One  of 
the  advantages  of  the  finite  element  method  is  its  capability  of  solving 
large  complicated  problems  in  a  routine  manner.  However,  the  same  con¬ 
cepts  used  in  a  program  for  large  systems  may  be  understood  using 
relatively  simple  problems. 

Let  us  consider  the  stability  of  a  Euler’s  column.  The  governing 
equations  are  as  follows: 
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D.E.  E  I  u""  +  P  u"  +  0)^  pAu  =  0  (la) 

B.C.  u(0)  =  u'(0)  =  0  (lb),(lc) 

u"(il)  =  0  (Id) 

E  I  u"’  (i)  +  P  u’(S,)  =0  (le) 


where  u  is  the  lateral  displacement,  a  prime  (')  denotes  differentiation 
with  respect  to  the  coordinate  xj  E  is  the  Young's  modulus,  p,  density 
of  the  material;  I  is  the  second  moment.  A,  area  of  the  cross-section,  £, 
length  of  the  beam  and  u)  is  the  eigenvalue.  For  eqs.(l),  a  usual  varia¬ 
tional  principle  can  be  written: 


where 


dJj  (u)  =  0 

Jj(u)  =  ^  [E  I  (u")^  -  P(u')^  +  w^pAu^]  dx 


(2a) 

(2b) 


To  establish  the  equivalence  between  eqs.  (1)  and  (2),  one  simply 
carries  out  the  variation  of  in  eq.  (2a) : 


6Ji  =  jJ"  [E  I  u"6u"  -  P  u'5u'  +0)^  pA  u  6u]  dx 

=  fg  [El  u""  +  P  u"  +  (jO^  pAu]  6u  dx 

+  [E  I  u"  6u'  -  (El  u"'  +  P  u')  du]^  _ 

-  [E  I  u"  6u'  -  (E  I  u'"  +  P  u')  5u]^  ^  Q 


(3a) 


(3b) 


From  eq.  (3b)  one  observes  that  for  the  coordinate  functions  and  their 
variations  satisfying  the  boundary  conditions  in  eqs.  (lb  -  le),  eq. 
(la)  implies  eq.  (2a)  and  vice  versa.  The  finite  element  formulation 
for  this  problem  begins  with  eq.  (3a). 


Let 


u(x)  =  aHx)  y 


(4) 


where  a(x)  is  the  displacement -function  vector  and  U  ,  the  generalized 
displacement  vector.  Upon  the  substitution  of  eq.  (4)  into  eq.  (3a), 
one  immediately  obtains 


6U 


{  Kl  +  I  U  =  0 


where 


Ki 


[E  I  a"  a"T  -  Pa’  a'h  dx 


M  =  /^  pA  a  a^  dx 


(5) 

(6a) 

(6b) 
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Eq.  (5)  is  not  yet  ready  to  be  solved  since  neither  U  nor  6U 
consists  of  independent  elements  due  to  the  boundary  conditions  re¬ 
quirements  placed  on  u(x). 

Let  us  now  consider  a  slightly  different  variational  principle: 

6J2  =  0  (7a) 

with 

*^2  “  J  ■^0  ^  (u")^  -  P  (u')^  +  co^  pAu^]  dx 

+  1  Oj  [u(0)]2  +  j  a2  [u'(0)]2  (7b) 

where  and  02  are  the  Lagrange  multipliers. 

Carrying  out  the  variation  of  eqs.  (7),  we  have 
£ 

<5J2  -  [El  u"  5u"  -  Pu'  6u  +  oj^pAu^]  dx 

+  u(0)  6u(0)  +  02  u' (0)  6u'(0)  (8a) 

=  /^  [E  I  u""  +  Pu"  +  pAu]  6u  dx 
0 

+  [E  I  u"  5u'  -(El  u'"  +  Pu')  6u] 

X~A/ 

-  [(E  I  u"  -  Ogu'  -(El  u'"  +  Pu'  +  oju)  6u]  (8b) 

Eq.  (8b)  states  that  a  necessary  and  sufficient  condition  for 
6J2  =  0  is  the  problem  defined  by  the  following  sec  of  equations: 


E  I  u"" 

+  Pu”  +  oj^pAu 

=  0 

(9a) 

E  I  u"(0)  -  02  u'(0)  = 

^  0 

(9b) 

u'"  (0) 

+  Pu' (0)  +  Oj 

u(0)  =  0 

(9c) 

E 

o 

11 

(9d) 

E  I  u'" 

(£)  +  Pu'  (£) 

=  0 

(9e) 

provided  that  the  variation  5u  is  completely  arbitrary,  comparing 
eqs.  (9)  and  (1),  it  is  seen  that  eqs.  (1)  is  a  special  case  of  (9)  as 
^1  *  ^2  approach  to  infinity.  From  eq.  (8a),  we  can  see  that  the  finite 
element  matrix  equation  now  becomes 
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where 

K~  =  K,  +  ttiaCOJa'^CO)  +  a,a' (O)a'^(O)  (11) 

The  matrix  jC  in  eq.  (11)  has  been  defined  in  eq.  (5)  and  the  super¬ 
script  T  denotes  the  transpose  of  a  matrix  (a  vector) .  Since  6u  is 
arbitrary,  6U  in  eq.  (10)  is  arbitrary,  eq.  (10)  leads  directly  to  the 
final  matrix  equation  to  be  solved. 

K^,  +  U  =  0  (12) 

It  is  then  clear  that  the  method  of  Lagrange  multipliers,  used  in 
conjunction  with  the  finite  element  method,  will  not  only  facilitate 
the  solution  formulations  but  also  encompass  a  larger  class  of  problems 
to  be  solved  compared  with  the  use  of  constrained  variational  statements. 
The  applications  of  the  same  general  concept  can  be  extended  further. 

3.  FROM  UNCONSTRAINED  VARIATIONS  TO  ADJOINT  VARIATIONAL  STATEMENTS. 

We  have  noted  that  the  variation  6u  in  eq.  (8)  is  quite  independent 
of  the  function  u  itself  and  nothing  will  be  changed  if  we  simply 
replace  6u  with  6v  to  emphasize  this  independence.  This  substitution, 
however,  has  suggested  the  adjoint  variational  principles.  Let  us 
consider 

SJ,  ■  0 

Jj  =  (E  l  u'V"  -  Pu'v'  +  tb^pAuv)  dx 
+  dju(0)v(0)  +  a2u'(0)v(0)  +  OjPu' (^')v(5-) 


(13a) 

(13b) 


Carrying  out  the  variations,  we  have: 

=  (6J3)„  *  (1^) 

where 

0 

(fij,)  =  /  (El  u''6v"  -  Pu'6v'  +  lo^pAuSv)  dx 
u  0 

+  Oj  u(0)6v(0)  +  a2u'(0)6v’(0)  +  a^u*  (H)  6v(Jl)  (15a) 

0 

=  /  (El  u""  +  Pu"  +  lo^pAu)  5v  dx 
+  (E  I  u"6v'  -  (El  u"'  +  Pu*  -  a3u')  5v]^  _ 
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(15b) 


E  I  u'"  (0)  +  P  u'  (0)  +  ai  u(0)  =  0  (17e) 

Now  eqs.(9)  has  become  a  special  case  of  eqs.(17)  when  03  =  0. 

In  addition,  the  problem  defined  by  (6J3)y  =0  of  eqs.  (16)  is  called 
the  adjoint  problem  to  eqs.  (17).  For  03  =  0,  the  adjoint  problem  is 
identical  to  the  problem  itself  —  hence,  the  self-adjoint  system.  Now, 
considering 

03  =  k  P  (18) 

in  eq.  (17c),  we  have 

E  I  u'"  («,)  -K  P  u'  (S,)  =0  (19) 

K  =  k  -  1  (20) 
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Eq.  (19)  defines  the  boundary  condition  of  a  general  non-conser¬ 
vative  load.  It  is  also  clear  from  eq.  (19)  that  K  is  a  dimensionless 
design  constant  which  defines  the  small  angle  between  the  direction  of 
the  applied  load  P  and  the  tangent  of  the  deflected  column  at  the  end. 
Since  (6J3)  =  0  alone  defines  the  boundary  value  problem  of  eq.  (17) 

and  vice  ver^a,  we  need  not  at  all  to  be  concerned  with  the  adjoint 
problem.  Now  it  is  a  simple  matter  to  modify  the  finite  element  matrix 
equation  as 


where 


+  0) 


•M  I  U  = 

i 


K3  =  K2  +  a3  a' (it)  a’^(it) 


(22) 

(23) 


4.  FINITE  ELEMENTS  FOR  INITIAL  AND  INITIAL- BOUNDARY  VALUE  PROBLEMS. 

fn  A  Forced  Vibration  Problem.  Let  us  first  consider  a  problem 
of  "one"  degree  of  freedom,  i.e.,  a  mass-spring  system.  The  differential 
equation  and  initial  conditions  are 

m  u  +  k  u  =  f (t) ,  0<t<T  (24a) 


u(0)  =  Uq  (24b) 

u(0)  =  ui  (24c) 


where  u(t)  is  the  displacement  of  the  mass  centre  from  its  equilib¬ 
rium  position,  m  ,  the  amount  of  mass  and  k  ,  the  spring  constant. 
The  function  f(t)  is  given,  so  are  the  constants  u^,  and  Uj .  The  constant 
T  appeared  in  the  bounds  of  eq.  (24a)  is  any  given  positive  number 
other  than  infinity.  In  order  to  formulate  approximate  solutions  for 
eqs.  (24)  the  way  we  did  in  the  previous  section,  let  us  consider  a  more 

general  case 

m  u  +  k  u  =  f(t)  (25a) 


u(T)  -  a  [  u(0)  -  Uq  ]  =  0 


(25b) 


u(0)  =  Uj  (25c) 

where  a  is  a  parameter,  obviously  eqs.  (25)  reduce  to  (24)  when 

ot  approaches  to  .  Now,  with  eqs.  (25),  we  are  able  to  write  an 
unconstrained  variational  statement  as  follows; 


6  J4  =  0 


(26a) 
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where 


T 

~  i  [  ~  +  kuv  -  f(t)v  ]  dt 

^  0 

+  ma  [u(0)  -  Uq]  v(T3  -  muj  vCO)  C26b) 

Since 

T 

f  "  mudv  +  ku6v  -  £(t)  6v  ]  dt 
+  ma  [  u(0)  -  Uq  ]  6v(T)  -  muj^6v(0)  (27a) 

=  [  mu  +  ku  -  f(t)  ]  6v  dt 

-  m  /  u(T)  -  a  [u(0)  -  uq]  }  6v(T) 


+  m  [u(0)  -  uj  ]  5v(0)  (27b) 

The  already  familiar  form  of  eqs.  (27)  state  that  (a),  (6J)u  =  0  is  a 
necessary  and  sufficient  condition  for  eqs.  (25), and  (b),eq.  (27a) 
provides  us  the  finite  element  matrix  equation;- -Thus if  we  assume  as 
before  that 


u(t)  =  a^(t)  U 


V  (t)  =  a^(t)  V 


Eq.  (27a)  yields 
where 


K.  U  =  6V'^  F 


and 


Ka  =  /.  (  -m  a  a^  +  k  a  a^)  dt 

+  ma  a(T)  a^(0) 


.1 

F  =  J  f(t)  a  dt  +  maun  a(t)  +  m  un  a(0) 

•V  *i\  »v*  ^  •St 


Again,  since  6V  is  unconstrained  eq.  (28)  leads  directly  to 

K4  U  =  F 

which  is  the  final  equation  to  be  solved. 


(28) 


(29) 


(30) 


(31) 
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(2)  A  Heat  Conduction  Problem.  The  one  dimensional  transient 
heat  conduct  problem  can  be  described  by  the  equation 


^  (K  )-  pc  -  f(x,t)  =  0 

(32a) 

Boundary  and 

initial  conditions  are 

u(0»t)  =  go(t) 

(32b) 

u(L,t)  =  g  (t) 

(32c) 

u(x,0)  =  h  (x) 

(32d) 

where 

K  =  thermal  conductivity 

and 


p  =  material  density 
c  =  specific  heat 
f(x,t)  =  heat  source  function 
gpCt),  gi(t)  and  hfx)  are  prescribed  functions 


Let  us  consider 

6J5  =  0 


(33a) 


v]dtdx 

+  oK  [u(L,t)  -  g]^(t)]  y(L,t)  dt 

-  ij  aK  [u(0,t)  -  ggCt)]  v(0,t)  dt 

-  pc  [u(x,0).  -  h(x)]  v(x,0)  dx  (33b) 

since 

(djc),,  r^K  6(|I.)  +  pc  ^  6v  +  f  (x,t)  6v]  dx  dt 

+  a  K  [u(L,t)  -  gi(t)]  6v(L,t)  dt 

jj  a  K  [u(0,t)  -  gpCt)]  6v(0,t)  dt 

f  pc  [u(x,0)  -  h(x)]  6v(x,0)  dx'  (34a) 

'0 
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LT 

=  Uo[ 


d 


(k|^) 


pc  -  f(x,t3]6v  dxdt 

O  L 


-  Jo^  K  {  .  a[u(0,t)-g^(t)]  JdvCL.t)  dt 

+  K  {  -  a[u(0,t)  -  gp(t)]  }6v(0,t)  dt 

L 

+  pc  [u(x,0)  -  h(x)  ]  6v(x,0)  dx  (34b) 

it  is  clear  that  (fijg)^  _  q  is  a  necessary  and  sufficient  condition 
for  eqs  (32)  as  a  ->  <»  and  eq.  (34a)  provides  the  finite  element  ma¬ 
trix  equation.  We  can  write  from  eq.  (34a), 

-  It  3F '"l 

+  aK  [u(L,t)  5v(L,t)  -  u(0,t)  6v(0,t)]  dt 
L 

+  p  c  u(x,0)  6v(x,0)  dt 

=  /o  dxdt 

T 

+  aK  [g^(t)  6v(L,t)  gQ(t)  6v(0,t)]  dt 
L 


+  /^  pc  h(x)  6v(x,0)  dx 

(35) 

Now,  let 

u(x,t)  =  a^(x,t)  U 

(36a) 

v(x,t)  =  a^(x,t)  V 

(36b) 

in  the  usual  manner. 

we  have 

K  U  =  dv"^  F 

(37) 

-LT  j  j 

K=  -  jl  rKaYaY  +  pcaa-.)dxdt 

-s,  'Q«'0  ^  ^9^"^ 

T 

+  /  aK  [a(L,t)  a(L,t)  -  a(0,t)  a^(0,t)]  dt 
0  ^  ^ 

L 

■  +  /  p  c  a(x,0)  a^(x,0)  dx  (38) 
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and 


LT 

F  =  a(x,t)  dxdt 

T 

+  aK  [gjCt)  a(L,t)  -  ggC^) 

+  p  c  h(x)  a(x,0)  dx  (39) 


Again,  since  6V  in  eq.  (37)  is  completely  arbitrary,  we  arrive  at  the 
final  matrix  equation  to  be  solved. 


K  U  =  F 


(40) 


(3)  A  Wave  Propagation  Problem.  For  a  quite  general  wave  propa¬ 
gation  problem,  the  following  system  can  be  written. 


L-  c2  f(x,t). 

(41a) 

dt^ 

u(0,t)  =  gQ(t) 

(41b) 

u(L,t)  =  g^(t) 

(41c) 

u(x,0)  =  hQ(x) 

(41d) 

u(x,0)  =  hj^(x) 

(41e) 

The  extension  of  the  previous  formulation  to  this  problem  is  straight 
forward.  Let  us  consider 


where 


=  0 

^6  =  A' t- It Hi" 

T 

-  a  [u(L,t)  -  g^(t)]  v(L,t)  dt 

T 

+  a  [u(0,t)  -  gQ(t)l  v(0,t)  dt 
L 

-  a  [u(x,0)  -  hQ(x)]  v(x,T)  dx 

L 

+  [u(x,0)  -  hj^(x)]  v(x,0)  dx 


(42a) 


(42b) 
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Again, 


,L,T 


t)6v]  dxdt 


-  a  [u(L,t)  -  g^(t)]  6v(L,t)  dt 

+  a  [u(0,t)  -  gpCt)]  5v(0,t)  dt 

,L 

-  a  Jj  [u(x,0)  -  hpCx)]  6v(x,T)  dx 

,L 

+  J  [u(x,0)  -  h  (x)]  6vCx,0)  dx 

0  X 


LT 


=/  /  [  ^  - 


2  a^u 


'O'O 


ax' 


at 


-  f(x,t)]  6y  dxdt 


4  {  “  tu(L,t)  -  gjCt)]  j.  6v(L,t)  dt 


4  { r - 


a  [u(0,t)  -  gpCt)]  I  6v(0,t)  dt 


4  {  "  t^Cx.O)  -  hgCx)]  j.  6vCx,T)  dx 


■  4  f  |^Cx,0)  -  hjCx)]  6v(x,0)  dx  (43b) 

From  eqs.  (43) ,  it  is  again  clear  that  (6J6)„  =  0  is  a  necessary  and 
sufficient  condition  for  eqs.  (41)  as  a  ->*  «>  and  that  eq,  (43a)  will 
yield  the  finite  element  matrix  equation.  From (43a)  one  has; 


(43a) 


-  a  Jf  u(L,t)  6v(L,t)  dt  +  a  J  u(o,t)  6v(0,t)  dt 
^  t 

-  a  /  u(x,0)  6v(x,T)  dx 

“  //  f(x,t)  6v(x,t)  dx  dt 
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Again,  let 


a  gi(t)  6  v(L,t)  dt  +  a  gpCt)  5v(0,t)  dt 
-  a  h(j(x)  6v(x,T)  dx  +  h^(x)  6v(x,0)  dx 

u(x,t)  =  a^(x,t)  U 
v(x,t)  =  a^(x,t)  V 


rT  -  T 


Eq.  (44)  becomes,  in  matrix  form, 

6  K  U  =  6  V  F 

where  T  L 


K  =  L  L  (-  a*  a'  +  c  a  a  )  dxdt 

T  T  T 

-  a  /  a(L,t)  a'^(L,t)  dt  +  a  f  a(0,t)  a^(0,t)  dt 

-  a  /  a(x,t)  a^(x,0)  dt 

F  =  £(x,t)  a(x,t)  dxdt 

+  a  h.(x)  a(x,T)  dt  +  h  fx)  a(x,0)  dt 


(44) 

(45a) 

(45b) 

(46) 


(47) 


(48) 


Due  to  the  arbitrariness  of  6V  ,  eq,  (46)  leads  directly  to  the  final 
matrix  equation 


K  U  =  F  (49) 

5.  NUMERICAL  DEMONSTRATIONS.  Several  numerical  examples  will  be 
given  in  this  section  to  demonstrate  the  application  of  the  formulation 
described  so  far. 

(1)  Forced  Vibration.  We  shall  consider  a  special  case  of  the 
forced  vibration  problem  formulated  earlier.  The  forcing  function  in 
eqs.  (24)  is  taken  to  be  a  cosine  function  thus,  rewrite  eqs.  (24), 

mu  +  ku  =  fo  cos  a)£t  (50a) 
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uCO)  =  Uq  (50b) 

u(0)  =  ui  (50c) 

where  Uq,  u^,  fQ  and  Uf  are  given  constants.  In  the  finite  element 
formulation,  we  shall  replace  eqs.  (50)  with  the  following  set 

m  ii  +  k  u  =  fQ  cos  W£t  (51a) 

u(t)  -  a  [u(0)  -  Uq]  =  0  (51b) 

u(0)  -  Uj  =  0 

thus,  eqs.  (50)  becomes  a  special  case  of  (51)  as  a  It  is  con¬ 

venient  to  nondimensionalize  the  independent  variable  t  and  let 


T  =  t/T 


In  terms  of  t,  eqs.  (51)  become 


where 


2  7 

ii  +  T  0)  u  =  fj  cos  (T  Wf  t) 
u(l)  -  T  a  [u(0)  -  Uq]  =  0 
u(0)  -  T  ui  =  0 
fj  =  fQ/m  0)^  =  k/m 


The  exact  solution  for  eqs.  (53)  can  be  easily  written  as 
u(t)  =  A  cos  (T  w  •  t)  +  B  sin  (T  w  ♦  t] 
+  n  cos  (T  Wf  .  t) 


with 


n  = 


■0 


m 


(C0^-(|)£^ 


P  = 


(1) 


(52) 

(53a) 

(53b) 

(53c) 

(54) 


(55) 


a  UQ  +  T  uj  cos  (Tu)  -  q  [a  +  Tuf  sin  (TtOf)] 
a  +  T  0)  sin  (T  w) 


(56) 


To  solve  eqs.  (53)  using  finite  elements,  one  begins  with  the  variational 
statement : 


6  J  =  0 

J  =  /^  [-  u  V  +  T^  (o^uv  -  f  (t)  v]  dr 


(57a) 


88 


Now  that 


(57b) 


+  T  a  [u(0)  -  uq]  v(1)  -  T  uj  v(0) 

(6J)u  =  0  (58a) 

1 

=  Jo  [-  u  6  V  +  u6v  -  f(T)6  v] 

+  T  a  [u(0)  -  Uq]  v(1)  -  Hij6v(0)  (58b) 

=  Jo  [u  +  (1)^  u  -  £(t)]  6v  dt 

-  {u(0)  -  a  T[u(0)  -  Uq]}  5v(1) 

+  {  u(0)  -  T  u^}  6v(0)  (58c) 


From  eq.  (58b),  one  has 
1 

Jj  [-  u  6  V  +  u  6  v]  dt  +  aT  u(0)  6v(l) 

1 

=  Jj  f(T)  5v  dx  +  a  T  Uq  6v(1)  +  T  uj  6v(0) 
u(t)  =  a'r(T)  U 


with 


eq.  (59)  leads  to 


or 


v(T)  =  aT(T)  V 
6  K  U  =  6  F 

K  U  =  F 


where 


K  =  / 

1 

F  =  /„  f(T) 


i-  i  0?  a  a"^)  dt 

+  a  T  a(l)  a(0) 
a  dx  +  aTu-  a(l)  +  T  ui  a(0) 


(59) 

(60) 


(61) 


(62) 

(63) 


The  results  obtained  from  this  finite  element  formulation  are  com¬ 
pared  with  the  exact  solutions  as  shown  in  Tables  1-3.  The  values  of 
the  parameters  chosen  for  these  data  are  k  =  1.0,  m  =  1.0,  fg  =  1.0, 

(Of  =  0.5,  Uq  =  1.0,  Uq  =  1.0  the  number  of  elements  used  is  ten.  The 
calculated  u  and  u  for  T  =  2.0,  10.0  and  20.0  are  given  in  Table  1, 

2,  3  and  4  respectively.  The  forcing  function  cos  (Oft  and  the  solu¬ 
tion  u(t)  are  also  plotted  in  the  range  0  <  t  <  20  as  shown  in  Figure  1. 


(2)  Solutions  to  a  Transient  Heat  Conduction  Problem.  As  another 
numerical  example,  we  shall  take  the  nondimens ional  heat  transfer  problem 
defined  by  the  following  set: 
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TABLE  1 


Solutions  to  the  Forced  Vibration  Problem  Using  FE-UVF 
Compared  with  Exact  Solutions  (in  Parentheses) 

0  <  t  <  2.0 


9 


TABLE  2 


Solutions  to  the  Forced  Vibration  Problem  Using  FE-UVF 
Compared  with  Exact  Solutions  (in  Parentheses) 

0  <  t  <  10.0 


t 

u(t) 

u(t) 

0 

1.000 

(1.000) 

1.004 

(1.000) 

1.0 

1.832 

(1.831) 

0.505 

(0.501) 

2.0 

1.770 

(1.768) 

-  0.675 

(-0.674) 

3.0 

0.566 

(0.565 

-  1.614 

(-1.608) 

4.0 

-  1.094 

(-1.094) 

-  1.518 

(-1.512) 

5.0 

-  2.123 

(-2.122) 

-  0.435 

(-0.435) 

6.0 

-  1.920 

(-1.919) 

0.778 

(0.773) 

7.0 

-  0.843 

(-0.843) 

1.213 

(1.207) 

8.0 

0.167 

(0.166) 

0.690 

(0.689) 

9.0 

0.436 

(0.435) 

-  0.126 

(-0.122) 

10.0 

0.114 

(0.114) 

-  0.385 

(-0.381) 
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TABLE  3 

Solution  to  the  Forced  Vibration  Problem  Using  FE-UVF 
Compared  with  Exact  Solutions  (in  Parentheses) 

0  <  t  <  20.0 
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D.E. : 


9^u  _  ^ 

3x2  at 


=  0  ,  0<x<l  ;  0<t<T 


(64a) 


B.C.  : 


u(0),t)  =  1,  ||  (l,t)  =  0 

u(x,0)  =  0 


uiuj,tj  =  1,  ^  (l,t)  =  0  (64b,c) 

uCx,0)  =0  (64d) 

tlTon  it  facilitate  compu- 

such  that  the  independent  variable  t  into  t 

(65) 


thus,  the  system  of  eqs.  (64)  becomes 


D.E. : 


3^u  1  au  -  _ 

7Y - =0  ,  0<x<l  ;  0<T<1 

3x2  T  at 


(66a) 


B.C.  : 


I.C.: 


u(0,T)  =  1  ;  (i^x)  =  0 

oX 

u(x,0)  =  0 


(66b, c) 
(66d) 


According  to  our  unconstrained  variational  formulation,  this  system  is 
again  replaced  by  the  following; 


afu  _  1  ^ 

3x2  T  aT- 


=  0  ,  0<x<l  ;  0<T<1 


9u 

aJ  (0,t)  +  a  [u(0,t)  -1]  =  0 


(67a) 


(67b) 


||  (1,T)  =  0 


(67c) 


u(x,0)  =  0 


(67d) 


reduces  to  (66)  as  a  ->  ■»  .  The  variational  state- 
ment  can  be  written  as 

where  1  1  ^  '■««») 

J.-J  I 

“  0  3x  ax  T  ax 

1 

+  a/^  [u(0,t)  -  1]  v(0,T)  dx 
1 

*  fo  u(x,0)  v(x,0)  dx  fesb) 

SL^thS^  unconstrained,  it  is  a  simple  matter  to 

(6J)u  =  0  (69^ 


94 


t 


Flaure  2.  Finite  Element  Grid  Scheme  Used  for  a 
-  Transient  Heat  Conduction  Problem 
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TABLE  4 


Transient  Heat  Transfer  Solutions  uCx,t)  Using  FE-UVF 
Compared  with  Exact  Series  Solutions  (in  Parentheses) 

0  <  t  <  T  =  1.00 


X 

t 

0 

0.2 

0.4 

0.6 

0.8 

1.0 

0.2 

1.000 

(1.000) 

0.754 

(0.757) 

0.583 

(0.496) 

0.370 

(0.405) 

0.264 

(0.284) 

0.228 

(0.179) 

0.4 

1.000 

(1.000) 

0.855 

(0.853) 

0.713 

(0.721) 

0.622 

(0.616) 

0.552 

(0.549) 

0.516 

(0.526 

0.6 

1.000 

(1.000) 

0.910 

(0.910) 

0.828 

(0.830) 

0.725 

(0.724) 

0.708 

(0.710) 

0.8 

1.000 

(1.000) 

0.945 

(0.945) 

0.896 

(0.896) 

0.857 

(0.857) 

0.832 

(0.832) 

0.823 

(0.823) 

1.0 

1.000 

(1.000) 

0.967 

(0.967) 

0.937 

(0.937) 

0.913 

(0.913) 

0.897 

(0.897) 

0.892 

(0.892) 
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TABLE  5 


Transient  Heat  Transfer  Solutions  u(x,t)  Using  FE-UVT 
Compared  with  Exact  Series  Solutions  (in  Parentheses) 

0  <  t  <  T  ■  0.05 


0 

0.2 

0.4 

0.6 

0.8 

1.0 

0.01 

1.000 

(1.000) 

0.144 

(0.157) 

0.014 

(0.005) 

0.002 

(0.000) 

0.000 

(0.000) 

0.000 

(0.000) 

1.000 

(1.000) 

0.315 

(0.317) 

0.047 

(0.046) 

(0.003) 

(0.003) 

(0.000) 

(0.000) 

(0.000 

(0.000) 

1.000 

(1.000) 

0.413 

(0.414) 

0.103 

(0.102) 

0.015 

(0.014) 

0.001 

(0.001) 

0.000 

(0.000) 

III2IIIIIIIIIII 

1.000 

(1.000) 

0.479 

(0.480) 

0.157 

(0.157) 

0.034 

(0.034) 

0.005 

(0.005) 

0.001 

(0.001) 

■ 

1.000 

(1.000) 

0.527 

(0.527) 

0.206 

(0.206) 

0.058 

(0.058) 

0.012 

(0.012) 

0.003 

(0.003) 
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is  a  necessary  and  sufficient  condition  of  eqs.  (67).  Now  the  finite 
element  matrix  equations  can  be  obtained  from  eq,  (69). 


or. 


(6J)  =  -  /  /  6  (— )  +  -  —  6v]  dxdt 

0  «  3x  ax  T  at 

1 

+  4  [u(0,T)  -  1]  6v(0,t)  dt 
1 

+  /  u(x,0)  6v(x,0)  dx  =  0  (70) 


6(^)  +  1  ^  6v]  dxdt 

ax  ax  T  ax 

1 

6v(0,t)  dt  +  /  u(x,0)  6v(x,0)  dx 
0 

1 

~  “  /j  6v(0,t)  dx 


(71) 


Using  the  usual  procedure  of  discretization  and  the  assumption 
of  displacement  functions,  the  final  finite  element  matrix  equation 
evidently  can  be  derived  from  eq.  (71).  We  shall  omit  the  details  here. 
The  computational  results  are  presented  in  Tables  4  and  5.  The  finite 
element  grid  scheme  used  is  shown  in  Figure  2.  As  clearly  shown  in 
those  tables,  excellent  agreement  exists  between  the  FE-UVF  approach 
3nd  the  series  solution.  It  is  noted  that  the  approximate  solutions 
are  less  accurate  invariably  as  they  approach  the  initial  time  t  =  0. 
This  is  probably  due  to  the  discontinuity  of  the  initial  boundary  data 
at  x  =  0,  t  =  0. 
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1.  INTRODUCTION 

This  paper  is  concerned  with  the  numerical  solution 
of  free  boundary  problems  by  mathematical  programming. 

In  such  problems,  one  seeks  the  solution  of  a  partial 
differential  equation  (usually  Laplace's  or  Reynolds' 
equation)  satisfying  prescribed  conditions  on  the 
boundary  of  a  region  when  a  portion  of  the  boundary  is 
unknown  and  must  be  determined  as  part  of  the  problem. 
The  unknown  boundary  is  called  the  free  boundary . 

Many  of  these  boundary  value  problems  have  not 
yielded  to  analytical  methods  of  solution.  Recently, 
however,  a  novel  transformational  approach  has  met 
with  more  success.  Specifically,  the  free  boundary 
problem  is  reformulated  as  a  variational  inequality 
which,  in  turn,  is  equivalent  to  a  certain  constrained 
minimization  problem  in  a  Sobolev  (function)  space. 
Although  this  latter  problem  is  still  computationally 
intractable,  finite  difference  or  finite  element 
approximations  yield  a  difficult,  but  solvable,  sparse, 
specially-structured  quadratic  programming  problem  of 
potentially  very  large  size.  It  is  the  solution  of 
this  last  problem  with  which  we  are  concerned  and  for 
which  an  algorithm  will  be  stated. 

2.  APPLICATIONS 


Free  boundary  problems  arise  in  a  variety  of 
situations.  Rohde  and  McAllister  [8]  have  developed 
the  variational  inequalities  for  the  finite-length 
journal  bearing  problem,  in  which  one  is  concerned 
with  a  cylindrical  rod  (the  journal)  rotating  within  a 
tube  (the  bearing) .  The  inner  surface  of  the  bearing 
is  coated  with  a  thin  film  of  lubricant  and  we  wish 
to  know  the  pressure  distribution  on  the  film.  At  a 
certain  point,  the  pressure  becomes  so  low  that  the 
lubricant  vaporizes,  thus  creating  the  free  boundary 
interface. 
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In  the  area  of  fluid  dynamics,  Baiocchi  et  al.  [1] 
have  reformulated  certain  problems  dealing  with 
stationary  fluid  flow  through  porous  media  as  varia¬ 
tional  inequalities.  These  include  porous  dams  in 
which  the  free  boundary  is  the  interface  between  the 
wet  and  dry  part  of  the  dam.  Brezis  and  Stampacchia 
12] ,  [3]  have  studied  the  determination  of  steady  sub¬ 
sonic  flows  for  nonviscous  compressible  and  incompress¬ 
ible  fluids  past  a  two-dimensional  convex  body  by 
using  a  hodograph  transformation  to  obtain  an  equiva¬ 
lent  free  boundary  problem  for  which  a  variational 
inequality  problem  can  be  stated. 

3.  THE  QUADRATIC  PROGRAMMING  PROBLEM 

The  common  denominator  of  these  and  several  other 
free  boundary  problems  is  that  their  associated 
quadratic  programming  problem 

Minimize  f(x)  =  j<x,Mx>  +  <q,x> 

subject  to  X  ^  0 

has  certain  special  attributes  which  can  be  exploited  in 
the  development  of  efficient  algorithms.  The  matrix  M 
is  a  block— tridiagonal  Stieltjes  matrix  (ie.,  symmetric, 
diagonally  dominant  with  nonpositive  off-diagonal 
entries).  Furthermore,  the  diagonal  blocks  are  them¬ 
selves  tridiagonal  matrices  and  the  off-diagonal  blocks 
are  diagonal  matrices. 

One  computationally  successful  ^approach  to  this 
problem  is  a  modification  of  the  block-  (or  line-) 
successive  overrelaxation  method.  This  algorithm 
requires  that  we  partition  the  vector 

ni 

^  *  ’  * '^m  x^  e  R  and  conformably 

ps^^tition  M  and  q.  For  this  special  class  of  problems, 
we  may  state  the  algorithm  as  follows: 

Algorithm 

Step  0.  Let  x^  =  (x^,  x^/  x®)  be  any  nonnegative 

vector,  eg.,  x®  =  0.  Let  we (0,2)  be  given.  Set  k=0  and 
i=l. 
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—lc+1 

step  1.  Determine  xv  ^  0  which  minimizes  (over  the 

n^^ 

nonnegative  orthant  ) 


f  (X 


k+1  k+1 


k+1 


2  ' 


r  V  f  X.  -f  •••/ 


i+1 


m 


1  k+1  k 

=  i  <V,M^.V>  +  <(q.  +  V>+C^ 


where  may  be  taken  to  be  zero. 


Step  2.  Define 


-  ,_-k+l 


k+1  —  —  ^ 

=  max  {u  :  w  ^  w,  +  a)(x? 


k+1  k  ^  k+1, -k+1  k, 

X.  =  X.  +  0).  (x.  -  X.  )  . 

1  1  Ji>  1  1 


x^)  ^  0 


Step  3.  If  i=m,  go  to  Step  4.  Otherwise,  return  to  Step  1 
with  i  replaced  by  i+1. 


Step  4.  Define 


m 


k+l\ 
1)1^ 


S=  {(i,j):(x^'^^),>0)U{(i,j):(x^+^)  ,-0,  (q  +  X 

1  D  13  1  1 

m 

If  max  I  (q.  +  A  M.^x^’’’^).]  <  e,  stop.  An  approximate 
(i,j)eS  ^  £=1  ^  ^ 

solution  is  at  hand.  If  not,  return  to  Step  1  with  k 


replaced  by  k+1  and  i=l. 

Step  1  requires  that  we  solve  a  smaller  quadratic 
programming  problem  whose  quadratic  form  contains  a 
tridiagonal  Stieltjes  matrix.  For  a  discussion  of  some 
fast  methods  to  do  this,  we  refer  the  reader  to  [5] .  For 
more  details  on  the  development  of  and  computational 
experience  with  the  algorithm  given  above,  see  [4] . 

From  a  consideration  of  storage  requirements  and  speed, 
one  may  conclude  that  this  algorithm  is  competitive,  if 
not  superior,  to  other  methods  described  in  the  literature. 
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ABSTRACT.  An  error  analysis  is  performed  vpon  the  stpei^sition  and  numeriral 
in'tegratiCTi~^rocedures  of  a  method  of  solution  of  multipoint  boundary  value  problems 
utilizing  power  series  ej^ansions.  Ihe  procedure  involves  the  evaluation  of  the  re™ 
lutive  error  of  the  Wronskian,  vAiich  provides  a  scalar  function  diaracterization  of 
the  error  of  integrators  of  a  matrix  of  solutions.  The  error  bdiavior  is  investigated 
by  using  different  integration  step  sizes  and  orders  (terms  of  the  pester  series) . 

Evaluations  are  performed  with  numerical  solutions  of  specified  accuracy  or 
order.  Example  ^plications  are  included. 

1.  iNTRODUCTiasi.  Nuiterical  integration  is  ocratiDnly  used  by  engineers  and 
t5r<-iiaiTHi  gf-g  ag  a  trmi  for  solving  ordinary  differential  eguations.  Those  eguatiens 
vhich  cannot  be  solved  exactly  or  in  closed  form  can  often  be  solved  using  numerical 
integration  technigues.  There  are  drsKidsacks  to  each  particular  integration  scheme. 

The  nost  important  considerations  are:  the  origin  of  the  problem,  guidelines  froti 
the  theory  of  the  algorithm,  the  cortputer  being  used,  and  the  class  or  problams  to 
be  considered,  Shanpine  and  Allen  (1973) . 

Many  techniques,  in  the  form  of  "canned"  routines  or  pre-programmed  methods, 
and  their  variations,  are  available  to  the  user.  It  is  now  possible  to  obtain  numeri¬ 
cal  solutions  using  techniques  vAiich  require  lengthy  operations.  Tte  more  popular 
integration  techniques  (i.e.  Adams  methods,  Runge— Kutta,  etc.)  provide  reasonable 
results  for  a  wide  range  of  applications.  They  are  subject  to  seme  disa^^es,  the 
most  canton  being  their  susceptability  to  round-off  error,  Ralston  and  Wilf  (I960) . 
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An  alternate  method  of  numerical  integration  has  been  investigated  which  is 
b^ed  upon  the  ejipansion  of  power  series.  The  method  is  relatively  free  of  tte 
disadvantages  of  the  more  popular  techniques  and  significantly  more  efficient  for 
certain  classes  of  problot^,  Doiron  (1967) .  Research  done  previously  by  Fehiberg 
(1964)  has  shown  the  power  series  technique  to  be  five  to  six  times  faster  than  a 
Runge-Kutta  method  for  the  same  specified  accuracy  in  certain  selected  problans. 

The  poiver  series  methods  generally  require  more  user  effort. 

The  purpose  of  this  study  is  to  investigate  the  error  in  the  integration  via 
power  series  ej^iansions.  It  has  been  shown  that  the  Ptronskian  can  be  used  as  a 
meaningful  check  on  the  solvability  and  superposition  procedures  in  the  solution 
of  boundary  value  problems.  It  has  been  proposed  that  the  relative  error  of  the 
Wronskian  can  provide  some  insight  into  the  errors  arising  froa  this  particular 
integration  schane,  Childs  et  al.  (1971) . 

2.  DEVEIDIMM'.  The  problems  to  be  considered  are  presented  as  an  ordinary 
differential  equation  written  in  the  general  linear  form  as 

y  =  +  f  (2-1) 

vhere  L  is  a  linear  operator  in  the  form  of  an  n  'x.  n  coefficient  matrix  (expressed 
as  a  constant  or  function  of  an  independent  variable) .  The  letter  y  represents  the 
state  variable  vector  and  y  denotes  the  derivative  of  y  with  respect  to  the  indepen¬ 
dent  variable  (in  this  case  t) .  The  vector  /  is  a  vector  of  forcing  functions.  The 
above  equation  is  subject  to  a  set  of  specified  boundary  conditions. 

qi(y(-tji))  ^  0<t^<T  i  =  l,2,...,m  (2-2) 

vhere  m  ^n.  The  operator  is  a  linear  oorribination  of  the  elements  of  the  vectors 
at  t  =  that  is  equal  to  the  boundary  value 

To  meet  the  atove  boundary  condition  it  is  necessary  to  superirtpose  irdependent 
solutions  of  equation  (2-1) .  The  technique  used  is  to  superiirpose  the  appropriate 
number  of  solutions  of  the  homogeneous  equation 

H  =LH  (2-3) 

upon  a  particular  solution 

P  =  Lp  +  f  .  (2-4) 

This  can  be  written  as 


y=p  +  m=  p+  I  3  r  <  n  (2-5) 

k=l 

vhere  is  a  matrix  whose  columns  are  homogeneous  solutions.  The  superscript  in 
par^theses  indicates  that  vector  is  the  (O^  column  of  a  matrix  denoted  by  the 
capital  letter  and  3  denotes  the  super^sition  constants.  The  letter  r  denotes 
the  number  of  homogeneous  equations  vhich  is  equal  to  the  number  of  unknavn  elements 
of  y(o). 
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It  is  known  fron  elanentary  differential  equation  techniques  that  the  sum  of 
a  particular  solution  of  a  linear  differential  equation  and  a  solution  of  its  hoto- 
geneous  differential  equation  is  merely  another  particvilar  solution  of  that  differ¬ 
ential  equation. 

Utilizing  this  fact,  it  can  be  established  that  y  can  be  ej^ressed  as  a  combi¬ 
nation  of  particular  solutions  (Childs,  1971) 


y  =  Fa  =  I  a 

k=0 


k 


(2-6) 


vAiere  P  is  a  matrix  viiose  columns  are  solutions  of  equation  (2-1),  thus; 


(2-7) 


Vfe  multiply  each  side  of  equation  (2-7)  by  ai^  and  sum  these  products 


f  I  p'’"’  I 

k=0  ^  k=0  ^  k=0 

^  cdtparing  equation  (2-1)  with  equation  (2—8) ,  it  is  obvious  that  the  left  hand 
side  of  equation  (2-8)  is  the  quantity  y  and  the  first  term  of  the  right  hand  side 
is  the  state  vector  y.  Iherefore,  it  is  elementary  that  the  si^erposition  constants 
must  obey 


I  a^  =  l  .  (2-9) 

k=0 

AEter  determining  the  superposition  constants,  subject  to  the  above  restriction, 
the  solution  becomes  trivial  and  is  generated  utilizing  the  initial  conditions 

y(o)  =  P(o)  a  .  (2-10) 

The  reason  for  superposition  of  solutions  is  to  satisfy  the  boundary  conditions. 
It  is  necessary  that  the  superimposed  solutions  be  ind^)endent  to  be  able  to  meet 
boundary  conditions. 

The  requirement  of  independence  is  satisfied  using  a  determinant  of  homogeneous 
solutions,  vhich  is  usually  known  as  the  Vfconskian.  The  independence  of  homogeneous 
solutions  is  satisfied  vben  the  matrix  ^bose  columns  are  these  vectors  of  rank  r. 

rank  (H)  =  r  (2-11) 

v4ii(oh  must  contain  at  least  one  r  x  r  submatrix  of  H  and  has  a  non-zero  determinant 
for  the  range  of  values  of  the  independent  variable. 

This  can  be  applied  to  superposition  of  particular  solutions.  Define  ''F  ss  an 
(n+1)  X  (r+1)  matrix  in  whi(oh  the  first  row  elatents  are  one  (unity)  and  remaining 
submatrix  is  P  (shewn  in  equation  2-6) . 
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= 


C2-12) 


1  i  JL  i.  1 
P 

The  Wronskian  of  Tank  n  has  been  shown  to  obey  the  following  equation, 

Petrovski  (1966). 

t 

det  (H(t))  =  det  (H(o))  exp  (  j  tr  (LCii))  d^)  (2-13) 

o 

vtere  tv  (L(^))  is  the  trace  (the  surrraation  of  the  principle  diagonal  of  the  matrix) 
of  the  coefficient  matrix  in  equation  (2-1) .  When  the  Wronskian  is  non-zero  at  the 
initial  value  of  the  independent  variable,  then  it  is  theoretically  non-zero  for  all 
values  of  the  independent  variable  over  any  finite  interval.  The  following  theorrai 
adapts  (2-13)  to  particular  solutions. 

Theorem: 


t 

det  (^P(t))  =  det  rP(o)}  exp  C  f  tr  (L(^))  d^) 

o 


Proof 


(o) 


The  columns  of  'P  are  p  ^  and 


(2-14) 


(2-15) 


The  subtraction  of  one  column  of  a  matrix  from  all  other  columns  does  not  affect 
^e  value  of  the  determinant  of  that  matrix.  Therefore,  subtracting  the  <3*^  column  of 
“■p  from  all  other  columns  and  ccanparing  with  equation  (2-13)  coitpletes  the  proof; 


det  C-P)  =  det 


o  I 

■  1  H 

o  , 


=  det  (H) 


(2-16) 


The  Wronskian  shows  that  solutions  are  (not)  linearly  independent  and.  that  a 
f\anc3aiiental  set  of  solutions  (doesn't)  exist. 


The  relative  error  of  the  Wronskian  is  defined  as  follows; 


R(t) 


^\w(t)\  -  \Wn(t) 

\w(t)  I 


(2-17) 


where  Wn  (t)  is  evaluated  utilizing  particular  solutions  \du.ch  come  from  numerical 
integration  procedures  and  W  (t)  is  evaluated  frcm  (2-14). 

3 ,  AN  EXA^pLE.  The  poi^er  series  integration  method  was  programmed  in  FORH^AN 
utilizing  an  Amdahl  470  digital  ccrtpater.  The  program,  subroutines  and  function 
routines  used  in  the  stucfy  were  provided  from  unpublished  studies,  Childs  (1975) . 

The  results  are  for  dartped,  forced  harmonic  oscillators  described  by  the 
following  equations; 
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yi  =  ^2 
^2  ~^^2, 

A  set  of  independent  pa2±icular  solutions  are  created  using  arbitrarily  chosen 
initial  conditions; 


Power  series  evaluations  are  then  generated  using  these  initial  conditions  and 
the  recursive  relationships.  Hae  set  of  particular  solutions  are  then  solved  over 
the  range  of  the  independent  variable,  t. 

By  calculating  the  Wronskian  using  both  numerical  procedures  and  the  analytical 
method,  the  relative  error  may  be  examined. 

The  results  of  the  relative  error  of  the  power  series  integration  procedure 
are  coirpared  to  results  <±»tained  from  previous  studies  by  Childs  et  al.  (1971)  con¬ 
cerning  the  same  prdalem  using  two  different  numerical  integration  procedures  with 
X  =  311^  P  =  0.2.  The  twD  integration  procedures  used  to  ooitpare  with  the  power 

series  method  are  modified  Euler  and  Runge-Kutta  methods.  They  are  order  yfi  and 
respectively,  where  h  is  the  integration  step  size.  The  two  plots  in  Figure  4-1 
are  log-log  plots  of  R(t)  versus  h  for  the  Euler  and  Runge-Kutta  procedures.  For 
these  results  it  has  been  observed  that  the  following  relationship  is  true: 

y(R(t))  =  R(yt) 

where  y  is  a  positive  scalar  quantity.  Fran  these  results  it  has  also  been  suggest¬ 
ed  that  the  relative  error  is  dominated  by  the  follow?ing  proportionality  for  "reason¬ 
able"  integration  step  sizes 

\R(t)  \  <x  t 

where  j  is  the  order  of  the  integration  formula  used. 

CQtrparing  both  cases  (Figure  4-1. a  and  4-1  .b)  it  has  been  determined  that 
th^  have  slopes  of  2  and  4  respectively.  It  has  also  been  observed  that  for  "large" 
step  sizes  the  points  tend  away  fron  the  straight  line  due  to  approximation  error  and 
also  for  using  "small"  step  sizes  due  to  round  off  error. 

Results  for  the  power  series  integration  procedure  are  presented  in  Figure  4-2 
in  the  form  of  a  log-log  plot  of  R(t)  versus  h  for  different  orders  (terms  in  the 
pcwer  series).  It  is  seen  that  a  family  of  curves  exist  for  different  orders.  It 
was  observed  that  for  constant  step  sizes  the  error  decreases  as  the  nuniber  of  terms 
increase.  As  the  step  size  increases,  the  number  of  terms  must  also  increase  in 
order  to  retain  a  specified  accuracy.  Like  the  Euler  and  Runge-Kutta  procedure,  the 
relative  error  tends  tcwaird  linearity  as  it  increases  with  step  size.  As  the  step 
size  decreases  for  each  "order  curve"  the  error  function  tends  toward  the  error 
specification.  This  observation  can  be  eisqjlained  by  the  evaltaation  subroutine  used 
on  accuracy  specif icaticai  of  1  x  10”^.  Thus,  more  accuracy  was  not  atteitptad. 


109 


since  the  curves  are  all  sutiilaT/  results  vdJ.1  be  explained  for  only  one  curve. 
For  the  "order  curve"  evaluated  at  4  terms,  the  step  size  begins  at  .001.  It  is 
^served  to  be  within  accuracy  specifications  due  to  the  fact  that  the  power  series 
integration  is  performed  with  such  small  step  sizes.  Since  such  small  steps  are 
used,  ^1  terms  (in  this  case,  4)  ajce  not  required  to  meet  the  accuracy.  As  the 
step  size  increases,  more  terms  are  required  to  meet  the  accuracy  specification.  At 
the  step  size,  .005,  it  is  observed  that  the  curve  "dips"^  Hiis  occurs  because  at 
this  st^  size  more  terms  (in  this  case,  1)  are  required  to  meet  the  accuracy.  From 
this  point  on  the  routine  is  utilizing  all  the  terms  of  the  power  series  in  order 
to  meet  the  accuracy  requirement.  However,  as  the  step  size  increases,  it  is  seen 
that  the  accuracy  is  not  being  met  due  to  the  larger  st^s  being  taken.  It  is  also 
seen  that  the  error  function  is  linear  while  all  terms  of  the  power  series  are  being 
used  and  would  continue  bo  be  linear  (within  machine  limitations)  if  it  were  not 
for  round-off. 

Results  were  also  calculated  for  several  values  of  (X,p) .  All  tendencies  held 
as  shewn  in  Figure  4-2. 

4.  CCMUJSIONS.  The  relative  error  of  the  Wconskian  can  apparently  be  used 
to  determine  if  the  step  size  used  by  an  integration  procedure  is  appropriate.  The 
error  would  grow  approximately  linearly  in  a  log-log  plot.  Further  investigations 
should  involve  different  systems  of  equations. 


no 
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INPUT  CONTROLLABLE  STOCHASTIC  MODEL 


Sheafen  Frank  Kuo 

U.  S.  Ariny  Construction  Engineering  Research  Laboratory 

P.  0.  Box  4005 
Champaign,  Illinois  61820 


1.  INTRODUCTION.  This  paper  introduces  a  model  which  incorporates 
the  principles  of  both  Markov  chains  and  finite  state  machines.  Markov 
chains  possess  stochastic  behavior  in  the  transition  between  states  but 
are  not  input  controllable.  Finite  state  machines,  on  the  other  hand, 
are  input  controllable  between  states,  but  do  not  have  stochastic 
behavior.  Basic  concepts  of  an  input  controllable  stochastic  model  and 
analysis  of  its  short-  and  long-term  behaviors  are  presented.^  Forecast 
accuracy  (FA)  of  a  model  is  defined  and  relations  between  strings  and 
models  are  described.  The  first  order  derivative  (FDD)  of  a  model  is 
introduced.  A  sufficient  condition  for  a  model  and  its  FOD  to  have 
equal  FA  is  proved.  In  addition,  some  applications  are  briefly  dis¬ 
cussed. 


2.  INPUT  CONTROLLABLE  STOCHASTIC  MODEL. 

A.  Definition.  An  input  Controllable  Stochastic  Model  (ICSM)  is  a 
quadruple  H  =  {I,  0,  S,  p}  where  I  is  the  input  set,  S  is  the  state  set, 
0  is  the  output  set,  and  p  is  a  probabilistic  function,  such  that 

p*  I  X  X  0  X  P 


whdre 

S^,  c  s 

P  =  the  set  of  real  numbers  between  0  and  1 . 


In  other  words,  given  input  x.  and  present  state  S^,  p  assigns  a 
probability  to  each  output  y^^^  and  next  state  Using  the  prop¬ 

erty  of  the  probability  function  gives 


I  I 

X_e0  S  eS 
rn  n 


l,for  all  x.el,  S.eS 

*  J 


B.  Example, 
follows: 


Let  I  *  0  -  {x,  y}»  S  —  {A,  B,  C}. 


p  is  defined  as 
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y(x.  A,  X,  A)  =  1/4 
y(x.  A,  y,  B)  =  1/4 
y(x,  A,  X,  C)  =  1/2 
y{y.  A,  y,  c)  =  1 
y(x,  B,  X,  A)  =  1/6 
y(x,  B,  y.  A)  =  1/6 
y(x,  B,  X,  C)  =  2/3 
y(y.  B,  X,  B)  =  1 
y(x,  C.  y,  A)  =  1/2 
y(x.  c,  y.  B)  =  1/2 
y(y»  Cj  X5  c)  ~  1 

y  =  0  otherwise 

C.  Graphic  Notation.  Noting  the  input,  output,  and  probability 
on  an  arc  path  between  two  states  S^.  and  Sj  gives  the  following: 


x/y,  1/2 


This  notation  means  that  given  input  x  and  current  state  S.,  the 

probability  of  getting  the  next  state  S^.  and  output  y  is  1/2.  Making  an 

arc  between  each  coitmiuni cable  state  would  give  a  flow  graph  for  that 
model.  The  flow  graph  of  the  last  example  is  shown  as  follows: 


x/y,  1/6 
x/x,  1/6 


y/x,  1 
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3.  INPUTTABLE  MARKOV  CHAIN  (IMC).  A  special  case  of  ICSM  of  inter 
est  in  this  paper  is  the  model  with  empty  output  set  0.  This  kind  ot 
model  is  called  the  Inputtable  Markov  Chain  (IMC). 

A.  Definition.  An  IMC  is  a  triple  G  =  {I,  S.  k}  where  I  is  the 

input  set,  S  is  the  state  set,  and  k  is  a  probability  function  which 
satisfies: 

k(Si,  Xy  S,^)  =  prob  =  5,^1  =  S.,  is  input}  for  all 

S^.,  S|^eS  and  x^el 


as  a  conditional  transition  matrix  of  input  x^.  Notice  that  the  suttma- 
tion  of  each  row  is  1. 

Suppose  at  each  state  S.  the  probability  of  getting  input  Xj  is 
Let 


be  a  diagonal  matrix 
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Let 


P  =  S  P. 
j=l  J 


Thus,  P  is  a  transition  matrix  without  knowing  an  input  variable. 

4.  SHORT-  AND  LONG-TERM  BEHAVIOR. 

A.  Short-Term  Behavior.  From  a  given  model  one  can  explore  the 
distribution;  i  e..  after  the  k-step,  the  model  will  go  to 
a  certain  state  with  a  certain  probability.  Two  cases  can  be  con- 
sidered: 

(1)  Input  string  is  given.  If  x^x2  ...  x^  is  the  input  string, 
then  the  k-step  conditional  transition  matrix  is 

Px^xg  .  .  .  x^  =  Px^  .  Pxg  .  .  .  Px^  (4) 

where  Pxj  is  the  conditional  transition  matrix  defined  by  equation  1. 

(2)  Input  string  is  not  given,  but  the  input  distribution  matrix 
Qj  IS  given.  Equations  2  and  3  can  then  be  used  to  find  P,  and  the 

k-step  transition  matrix  is  as  shown  in  equation  5. 

p''  -  P  .  P  .  .  .  P  (5) 

- V - ' 

k  times 

B.  Long-Term  Behavior.  For  long-term  analysis,  only  the  case 

considered  here.  If  k  is  large,  calculating  P*"  is 
cumbersome,  but  applying  the  z- transformation,  which  is  a  com¬ 
mon  way  of  calculating  the  power  of  a  stochastic  matrix,  simplifies  it. 

q(k)  =  P*^ 

The  z- transformation  Q(z)  of  q(k)  is  defined  as: 


Q(z)  =  E  q(k)z 
k=0 
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or 

Q(z)[I  -  =  ‘l(O) 

(7) 

since 

q(0)  =  1 

Hence 

Q(z)  =  [1  -  z'^Pf'' 

(8) 

Let  the  inverse  transform  of  Q(z)  =  y{k). 

Therefore 

y(k)  =  a''[(I  -  z'^P)"’] 

(9) 

II 

(10) 

or 

y(k)  =  p*^ 

(IT) 

From  equation  11, 

lim  =  lim  y(k) 
k  ^  “  k  -»•  “ 


5.  STATE  PROBABILITY  AND  FORECAST  ACCURACY  OF  A  MODEL. 

A.  Definition.  State  probability  (or  state  frequency)  of  stata 


S.  is  defined  as 


021 

where 

’’jl  ■  1 

(13) 

-  the  probability  of  input  Xj^  at  state 

^jki 

=  the  conditional  probability  of  Sj  transferring  to 

S|,  given  input 

Obviously 

Z  P.  =  1 
•  1 

(14) 

Using  equations  12  through  14,  P^  for  each  i  can  be  found. 

The  forecast  accuracy  [FA(R)]  of  a  model  R  is  defined  as: 

FA(R)  =  Z  E  .  max  P,y] 


119 


Intuitively,  FA(R)  is  the  maximum  average  probability  of  forecasting 
the  next  state  correctly,  given  the  current  input  and  state. 

B.  Example  1. 


b,  1/2 


b,  1/2 

inouJIlarnhil!  ®  Simple  model  R  with  two  states  and  two 

JJK  s  ®  ^  probably  at  each 

state.  Simple  calculation  usings  equations  10,  11,  and  12  gives: 

state  frequency:  =  j  **8  ~  J 

Therefore,  state  A  is  visited  twice  as  frequently  as  state  B  is  visited. 
fA{R)  .  .  mx  {P^^,  P^^j)  +  p^  .  .  max  CP^^.  P^g} 

*  ■  ^Ba  ‘  ^^BaA*  ^BaB^  +  Pr  *  ^Rh  '  "lax  J 


Since 


Hence 


Therefore 


B  '’Ba  -BaA*  ^BaB^  ^  -^B  ‘^Bb  ^*'BbA*  ^BbB 

’Aa  “  =  ABa  '  ’Bb  =  ?  '  I  ^B  ’  J 

fA(R)  '  •  max  <1,  0)  +  j  •  max  {^,  j)  +  j  •  max  {I,  0} 

+  I  •  max  {1,  0}] 

FA(R)  =  U- 


The  average  chance  of  forecasting  the  next  state  correctly  is  11/12 
given  the  current  state  and  input.  ’ 

It  is  trivial  to  see  that  a  deterministic  model,  like  a  finite 
State  machine,  has  a  forecast  accuracy  of  1. 

The  following  are  some  of  the  trivial  properties  of  FA(R): 

(1)  FA(R)>max  Z  I  Piq-^P.^. 

~  j  i  k  ' 
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if  number  of  state  is  2 


(2)  FA(R)  =  1-51  Piko 

T.  k.  J 

(3)  Define  FA(R!k)  =  I  max 

Then  min  FA(Rlk)  <  FA(R)  <  max  FA(Rlk) 
k 


6.  THE  STRING  AND  MODEL.  Consider  the  following  string: 

aAaAaAbAbBbBaAbAbBaAbBaAbB  ...  05) 


where  A  and  B  are  state  variables  and  a  and  b  are  input  variables. 

After  sufficient  observation,  a  model  like  that  shown  below  can  be 
developed. 


b.  1/2 


a,  1 


Combining  the  last  state  with  the  current  state,  or  putting  the  current 
state  to  the  left  upper  corner  of  the  next  state  gives 

AAAABBAABABAB 
aAaAaAbAbBbBaAbAbBaAbBaAbB  ... 

Putting  the  upper  characters  down  gives 


aAaAAaAAbAAbABbBBaBAbAAbABaBAbABaBAaABbB 


06) 


String  16  is  said  to  be  a  First  Order  Derivative  (FOD)  of  string  15, 
FOD's  are  developed  to  increase  the  number  of  states  so  that  the  systOT 
ifblne?  described.  For  example,  ff  string  15  Is  an  observer's  gather 
record  with  A  and  B  meaning  sunny  and  rainy,  Fespectively,  and  a  and 
meaning  decreasing  temperature  and  increasing  humidity,  'respectively. 

An  FOD  of  String  15  can  be  derived  to  String  16  with  AA 

cloudy,  BA  as  partly  cloudy,  and  BB  as  rainy.  Therefore,  String  16  is 

more  descriptive  than  String  15. 
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7,  THE  DERIVATIVE  OF  THE  MODEL. 

Definition.  Like  a  string,  the  model  also  has  derivatives. 
Let  R  be  an  IMC. 

R  =  {I,  S,  H},  I  =  {xp  X2 . x^}, 

S  =  {S^,  $2  .  .  .  H  =  {P^  j  =  1,  .  .  .  n} 

where  is  a  transition  m-matrix  under  input  x.. 


Define 


S'  =  S  X  S  =  {S,;  I  S'  =  S.Sy,  s.,  s.csy 


where 


*^ikj  element  of  P^^,  PC  be  an  element  of  P' 

P'  is  a  transition  matrix  of  dim  m^  x  m^  under  x 


such  that 


Define 


^VJo  ^  ^^0  ^  %  =  VS. 

and  if  i  =  r,  otherwise  PC  ,  .  =  n 

=  (P;  I  k  =  1,  .  .  .,  n}  and  r  =  I 
^k 


Then  R"  =  {1%  s',  H'}  is  an  FOD  of  R  =  {I,  S,  H} 


below. 


B.  Example  2.  The  FOD  R'  of  the  model  R  in  Example  1  is  as  shown 


Similar  to  Example  1,  assume  that  input  a  and  b  are  equally  probable 
at  each  state.  Then  it  can  be  shown  that 

FA(R}  =  FA(R")  071 

In  general,  equation  17  is  not  true.  However,  the  sufficient  con¬ 
dition  of  it  can  be  found. 

Define  index  sets  of  R,  R"*  as  follows: 

T  =  {r  1  S^eS} 

T"  =  {r  1  S;:eS"} 

T:[  =  {r  I  S'eS  x  S.}  where  S  x  =  {S^.S.  I  jeT} 

LEMMA  1: 

(1)  For  all  igCTf  jeT  there  exists  j^cT"  s.t.  P.,^  = 

(2)  For  all  i^eTJ  j^eT"  there  exists  jeT  s.t.  P^,^2lP-^kj^ 


081 

091 

UQI 

QED 
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THEOREM: 


If  q 


ik 


Then 

Proof: 


=  qj  for  an  i  eT'  and  if  E  Pr  =  P.  for  all  i 
0  ®  ^  ipT;  ^ 


0  1 

FA(R)  =  FA(R") 


Since  FA(R)  =  z  E  P.q.K  max  P,,  . 

1  k  ^  j  ikj 

FA(R")  =  e  E  p:  qr  .  max  PI 

ip  k  ^0  ^0*^  ’o'^o 

=  I  E  E  Pi  q^  r,  max  PI 
i  k  i^eTr  ’o  ’o*^  ^o^^o 

°  1  k  1  eTf  V*"  "f  '’l« 

0  1 

=  Z  z  [(,^^  max  P,y)P,] 

'^0 

Hence  FA(R)  =  FA(R')  qED 

... .  surprising  that  for  most  models  R  and  its  FOD  R',  con- 

uitions 


‘'ik  "  ‘'y 

and  E  Pr  =  P  for  all  i 
»  ’ 

arOtOasily  satisfied;  thus,  the  forecast  accuracy  of  the  FOD  R^  is  not 


1^, ^application.  The  Markov  Chain  has  been  applied  to  many  manage- 

bMLsri?‘'Lrmnr  improves  the  Markov  Chafn 

Kfn?«  ®  ^  features  to  adapt  the  real  work  of  physical,  economic, 

systems  [1],  [4],  [6].  The  Sost  importaT  ’ 
controllability,  allows  one  to  understand  a  system 
subsequent  changes  in  state  (and  out- 
fir;  processing  is  stochastic,  the  finite  state  machine 

ior  automata)  cannot  describe  the  procedure  properly.  If  a  model  can  be 
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build  corresponding  to  a  string  of  data,  the  model  can  then  be  tested 
aSd  Laluated  by  calculating  its  forecast  accuracy.  The  FOD  is_a  usefu 
tool  in  understanding  the  model,  as  illustrated  by  the  weather  forecasting 

example. 

Some  stochastic  automata  have  already  been  applied  to  the  reliability 
problem  and  decision  process  [8].  It  is  hoped  that 
paper  will  create  a  new  interest  in  the  research  in  a  discrete  system. 
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A  SCANNING  ELECTRON  MICROSCOPE  INVESTIGATION 
OF  STATICALLY  LOADED  FOUNDATION  MATERIALS 
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Champaign,  Illinois  61820 


ABSTRACT.  Selected  rock  samples  were  tested  to  failure  in  bending 
tension  and  compression  test  modes  within  the  vacuum  stage  of  a  scan¬ 
ning  electron  microscope  (SEM).  The  load  was  applied  slowly  such  that 
crack  initiation  and  growth  could  be  observed  and  recorded  by  photo¬ 
graphy  and  video  tape.  The  failure  surfaces  were  further  evaluated  by 
standard  methods  to  determine  failure  mechanisms  involved  for  each  test 
mode  and  rock  type. 

1.  INTRODUCTION.  An  understanding  of  the  physical  properties^ 
and  behavior  of  rock  materials  (rock  engineering)  is  necessary  to  im¬ 
plement  a  systems  approach  for  designing  a  structure.  Structural 
design  considerations  may  include  rock  removal,  tunneling,  use  of 
rock  as  a  foundation  material,  or  any  combination  of  these  factors. 
Information  about  the  fundamental  mechanisms  of  the  fatigue  and  fail 
ure  properties  of  rock  is  essential  and  should  be  available  to  the 
design  engineer.  Since  construction  of  underground  structures  such 
as  tunnels  for  defense  facilities,  underground  power  plants,  and  hy¬ 
draulic  structures  has  increased,  and  since  idealized  construction 
sites  are  not  always  available,  it  is  essential  that  rock  failure 
mechanisms  be  controlled  by  proper  design  practice. 

There  have  been  few  investigations  concerning  the  failure  modes  of 
rock  materials  in  simulated  field  tests,  primarily  because  of  the  exper¬ 
imental  problems  associated  with  controlling  rock  failure.  Wawersik, 
Brace,  and  Fairhurst  (AROD  Proposal  11278  EN)  have  investigated  the 
post- fail  ure  behavior  of  selected  materials.  Brace  ^ 

vesti gated  the  microcavities  in  crystalline  rocks;  and  Brace  (AR0_ Contract 
DAHCO  4-73-C-0017)  is  presently  investigating  the  microstructure  in 
crystalline  rocks  with  a  scanning  electron  microscope.  The  study  herein 
complements  these  and  other  investigations  by  advancing  the  state-of- 
the-art  of  failure  mechanisms. 

2.  EQUIPMENT.  The  AMR  900  Scanning  Electron 

a  hioh-resolution  instrument  providing  surface  resolution  of  po  to 
200  A  ahd  useful  magnification  of  up  to  50,000X.  The  depth  of  focus 
accurate  to  tens  of  microns.  This  means  that  a  fairly  rough  surface, 
such  as  a  rock  fracture  surface,  will  remain  in  focus  ajjiigh  magni¬ 
fications.  The  micrograph  obtained  appears  similar  to  that  obtained 
from  the  reflection  light  microscope,  but  it  has  much  better  resolution 
and  depth  of  field. 
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Qnn  stage  and  chamber  door  assembly  is  inserted  in  the  AMR 

replace  the  standard  door  and  stage.  Depending  on  the 

if bending  or  tension-compression  device 
IS  mounted  on  this  assembly  during  its  operation.  A  platform  mounted  on 
the  base  plate  provides  the  X  motion  (right  and  left),  the  Y  motion  (back- 

brtticufXf^LfJf  ^  's  actuated 

^  shafts  to  the  door  and  a  single  knob  on  the  front  face  of 

the  door.  One  revolution  of  the  exterior  knob  represents  a  change  in  Z 

digit  corresponds  to  a  change  in 
specimen  neight  of  0.1  mm.  A  counter-clockwise  rotation  of  the  stage 
raises  the  bending  stage  and  closes  the  compression-tension  heads  The 
same  revolution  and  motion  changes  apply  to  the  X  and  Y  directions. 

The  bending  stage  (Figure  1)  is  custom-designed  to  load  a  rectangu¬ 
lar  specimen  having  maximum  dimensions  of  1  x  1  x  6  in.  in  simple  three- 
point  bending  to  a  maximum  load  of  2000  lb.  This  stage  is  essentially  a 
platform  having  a  knife  edge  on  its  top  surface  that  supports  the  speci- 
men  at  the  center  of  its  bottom  surface.  A  load  bar  connected  to  the 
platform  by  a  ball  screw  and  gear  system  is  connected  to  the  edge  which 

cinfi?°r  specimen.  The  points  of  the  specimen's  ten¬ 

sion  links  continually  vary  to  accommodate  specimens  of  3  to  5  in.  in 
length.  The  maximum  bar  deflection  is  0.375  in.,  it  is  applied  via  the 
hand  crank  on  the  outside  of  the  chamber  door.  Each  digit  of  the  read¬ 
out  corresponds  to  a  specimen  deflection  of  0.004  mm  at  no  load. 


The  bending  device  is  loaded  into  the  chamber  parallel  to  the  Y 
axis  at  an  angle  of  45  degrees  to  the  horizontal.  Two  positions  180 
degrees  apart  are  possible,  allowing  observation  of  the  tension  face  or 
a  side  face  of  the  specimen. 


The  tension-compression  stage  (Figure  2)  consists  of  two  heads 
mounted  on  a  pair  of  right-  and  left-hand  ball  screws.  When  the  screws 
are  rotated,  the  heads  move  either  together  or  apart,  but  remain  par- 
allel.  Compression  specimens  are  placed  between  the  flat  surfaces  of 
the  heads  for  testing.  Tension  specimens  may  be  held  in  place  by  var¬ 
ious  technigues.  In  this  study,  sguare  steel  heads  with  a  centered 
slot  and  a  pm  hole  normal  to  the  slot  were  epoxied  to  the  specimen  ends. 
These  in  turn  were  connected  by  the  pin  to  threaded  rods,  flattened  at 
one  end,  which  fed  through  the  holes  in  the  stage  heads  (Figure  3) 


The  gear  train  used  for  specimen  deflection  is  the  same  used  for  the 
bending  state;  however,  one  digit  of  readout  corresponds  to  0.005  mm 
•  ^^^Tance  between  the  heads.  Minimum  distance  between  the  heads 

IS  0.25  in.,  and  maximum  distance  is  4.0  in.  The  maximum  load  which  may 
be  applied  to  either  failure  mode  (tension-compression)  is  2000  lb.  The 
stage  itself  is  tilted  at  an  angle  of  15  decrees  to  the  horizontal;  how¬ 
ever,  since  a  specimen  may  be  placed  in  any  orientation  between  the  heads, 
any  desired  tilt  may  be  obtained. 
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3.  MONITORING  DEVICES.  A  secondary  (backscatter)  electron  'image  for 
direct* observation  of  the  specimen  is  developed  and  displayed  on  a  signal 
modulation  unit.  (This  is  the  primary  visual  means  of  specimen  observa- 

ti on . ) 

In  addition,  the  secondary  image  may  be  displayed  on  a  9-in.  square 
TV  rate  monitor  display  unit.  This  unit  displays  the  same  field  as  the 
previous  module,  but  has  a  limited  magnification  range  of  from  lOOX  to 
10,000X.  It  has  a  built-in  zoom  capability  that  allows  closeup 
a  small  area  in  the  center  of  the  TV  and  is  operable  at  all  magnifications. 

Photomicrographs  are  obtained  through  a  record  oscilloscope  4x5  in. 
square  and  Polaroid  52-P/55-P/N,  4  x  5  in.  film._  An  alphanumeric  generator 
is  integrated  into  the  signal  modular  display  unit  in  order  to  facilitate 
identification  and  description  of  the  photomicrographs. 

4.  SPECIMEN  PREPARATION.  Table  1  lists  the  representative  suite  of 
rock  samples  chosen  for  evaluation  in  this  study  and  summarizes  their 
physical  characteristics.  One  set  of  specimens  was  prepared  for  each  of 
three  test  modes:  bending,  tension,  and  compression.  In  addition,  three 
cross-sectional  dimensions  were  prepared  to  determine  any  specimen  size 
effects. 

Bending  (flexure)  specimens  were  sawed  into  beams  and  ^''ound  square^ 
in  lengths  from  4  to  5  in.  long  and  cross  sections  of  1/8,  1/4,  and  1/2  in. 
square.  A  fine  notch  was  filed  into  the  top  (tension)  surface  to  control 
crack  origin  during  scanning  at  high  magnifications.  This  notch  was  approX' 
imately  1/16  in.  deep  for  all  specimens. 

Tensile  specimens  were  prepared  in  the  same  manner,  but  were  cut  2-1/8 
in.  in  lenqth.  Notches  were  ground  into  opposite  sides  ofthe  specimens  to 
minimize  extraneous  stress  concentrations  at  other  points  in  the  specimen 

The  mtact  cross-section  varied  from  3/16  to  1/4  in  depending 
on  the  specimen  size.  Only  the  tensile  specimens  were  modified  for 
testing;  a  steel  head  was  epoxied  to  each  end  to  facilitate  application 
of  pure  tensile  stresses. 

Compression  specimens  were  prepared  similarly  to  the  tensile  speci¬ 
mens  in  lengths  of  2  to  3  in.,  with  no  notching  or  other  preparations 
made  after  grinding. 

a.  specimens  were  strain-gaged,  coated  under  vacuum  with  gold- 
platinum  to  facilitate  conductivity,  and  wrapped  in  aluminum  foil.  The 
purpose  of  the  foil  wrapping  was  to  prevent  spall  mg  during  testing  or 
at  a  failure  which  could  harm  internal  portions  of  the  vacuum  system. 
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5.  SPECIMEN  FAILURE  EVALUATION.  The  selected  rock  specimens  vtere 
evaluated  for  bending,  tension  and  compression  failure.  Prior  to  the 
SEM  evaluation,  representative  strain-gaged  specimens  were  tested  to 
failure  in  each  mode  outside  the  vacuum  drawer  to  determine:  (1)  if 
failure  at  each  size  in  each  mode  was  feasible;  (2)  the  extent  of  spalling. 
If  any;  and  (3)  where  and  how  failure  would  occur  on  the  specimen. 

Problems  encountered  were  associated  with  the  compressive  failure  mode, 
which  proved  to  spall  excessively  and  to  fail  unpredictably  along  the  en¬ 
tire  length  of  the  specimen.  For  the  Westerly  Blue  granite  and  trap- 
rock  shale  specimens,  the  test  size  had  to  be  reduced  to  1/4  in.  square 
(in  compression)  to  facilitate  the  2000  lb  maximum  load. 

SEM  failure  methods.  The  bending  (flexure)  failure  mode  was  first 
evaluated  in  the  vacuum  drawer  by  applying  load  to  a  specimen  up  to  a 
strain  level  approaching  failure.  At  the  point  approaching  failure,  the 
load  application  was  slowed  to  approximately  0.3  mm/min.  the  notch  area 
was  scanned  during  this  load  application.  Slow  load  application  was  con¬ 
tinued  until  crack  initiation,  when  the  load  was  stooped  and  the  crack 
scanned.  If  the  crack  was  partial,  loading  was  applied  again  while  the 
crack  tip  was  followed  with  a  scan.  Loading  was  halted  periodically  for 
a  side  to  side  scan.  For  the  bending  failure  mode,  there  were  no  signifi¬ 
cant  changes  indicated  on  either  side  of  the  failure  plane. 

For  the  tensile  failure  mode  evaluation,  a  slight  seating  load  was 
applied  manually  to  the  specimen  before  placing  it  in  a  vacuum,  so  that 
the  specimen  would  not  rotate  during  load  application.  Since  this  test 
mode  builds  up  stress  prior  to  failure,  most  specimens  failed  rapidly, 
even  at  a  very  small  load  rate.  In  some  cases,  a  scan  was  possible  be¬ 
fore  complete  separation.  When  side  scans  were  performed  in  this  failure 
mode,  such  secondary  phenomena  as  grain  separation  were  present. 

7. _  FAILURE  SURFACE  EVALUATION.  After  completion  of  the  SEM  failure 
evaluation,  one  surface  of  each  failed  specimen  was  mounted  on  studs  and 
coated  with  gold-platinum.  These  surfaces  were  then  evaluated  by  stan¬ 
dard  SEM  evaluation  procedures  and  a  standard  stub  stage.  This  evaluation, 
together  with  the  SEM  failure  evaluation,  was  the  basis  of  the  failure 
analysis. 

8.  FAILURE  ANALYSIS.  When  the  beams  failed  in  a  bending  mode,  both 
intergranular  and  transgranular  failure  mechanisms  were  present,  usually 
in  approximately  equal  distribution;  however,  different  rock  types  exhib¬ 
ited  each  failure  mechanism  to  different  degrees.  The  Bonne  Terre  limestone 
and  Westerly  Blue  granite  exhibit  approximately  equal  distribution  of  the 
inter-  and  transgranular  failure  mechanisms.  The  Traprock  shale  and 
Murphy  marble  beams  primarily  displayed  transgranular  failure  and  inter¬ 
granular  failure.  The  Danby  marble  primarily  showed  intergranular  failure 
and  some  transgranular  failure,  while  the  Berea  sandstone  exhibited  100 
percent  intergranular  failure  mechanism. 
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The  tensile  failure  test  mode  showed  no  preferences  to  either  rock 
tvoe  or  crystal /grain  size  relative  to  the  failure  mechanisms.  Both 
inter-  and^transgranular  failure  mechanisms  were  approximately  equally 
S?s?lribu?ed  for  lach  rock  type  evaluated.  These  figures  also  indicate 
the  variation  in  crystal/grain  size  for  the  six 
The  Westerly  Blue  granite,  Murphy  marble,  and  Bonne  Terre  limestone 
display  good  crystal  cleavage  planes.  The  Berea  sandstone  exhibits 
surface  wear  on  the  individual  sand  grains. 

As  anticipated  in  the  compression  mode,  transgranular 
anisms  were  present  due  to  the  nature  of  the  test.  However,  the 
sandstone  and  Bonne  Terre  limestone  exhibited  an  unexpectedly  excellent 
intergranular  failure  mechanism.  The  Westerly  Blue  granite,  Murphy  marb  , 
and  Traprock  shale  exhibited  predominantly  (95  .'•^’^^^^^ranular 

failure;  the  Danby  marble  displayed  both  failure  mechanisms,  with  inter 
granular  failure  predominating. 

Table  2  summarizes  the  failure  mechanisms  relative  to  test  mode 
and  rock  type. 

Q  isllMMARY  AND  CONCLUSIONS.  Selected  rock  samples  were  prepared 
and  tested  to  failure  oy  oenoing,  tension,  and 

variium  staoe  of  a  scanning  electron  microscope.  Load  was  applied  ver^y 
slowly  in  order  to  observe  crack  initiation  and  growth.  Crack  Qi^owth 

was  oLerved  visually  and  recorded  by  both  ''''^®°„5®Pj' 

The  crack  surfaces  of  the  failed  specimens  were  evaluated  by  standard 
methods,  and  two  evaluation  technques  were  used  to  determine  the  failure 
mechanisms  for  each  test  mode  and  rock  type  studied. 

iicinnc  Ra<;pd  on  the  techniques  of  stub  evaluation  and  failure  in 
BTvS^stag^Ihrftlioiing  statenents  apply  only  to  those  test  .odes 
and  rock  materials  studied  herein: 

a  Cross-section  size  differences  had  no  effect  on  the  failure 
mode  The  only  benefit  derived  from  studying  several  sizes  were  facili- 
Sion  of  cSession  testing  of  granite  and  shale  specimens. 

b.  The  rock  types  evaluated  in  this  study  had  no  apparent  effect 
on  the* failure  mode  or  the  failure  mechanisms. 

c.  Crystal/grain  size  directly  and  significantly  influences  the 
failure  mechanisms  as  follows: 

(1)  Large  crystals/grains  -  failure  was  primarily  transgranular 
for  each  test  mode. 

(2)  Small  crystals/grains  -  failure  was  primarily  intergranular 
for  each  test  mode. 
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d.  Cementing  agents  have  little  or  no  effect  on  the  gross  failure 
mechanisms;  however,  failure  in  the  cementing  agent  was  exclusively 
transgranular.  ^ 

10-  RECOMMENDATIONS.  This  study  has  proved  the  feasibility  and 
usefulness  of  applying  a  metallurgical  research  tool  to  geologic  mate¬ 
rials.  The  present  study,  in  conjunction  with  studies  by  Brace  of 
specimen  preparation  techniques,  could  yield  valuable  information  in 
the  area  of  geophysics  and  earthquake  analysis.  Studies  relative  to 
slickenside  development  in  clay  shales  and  other  shear  phenomena  of  soil 
and  rock  could  be  advanced  by  this  approach. 
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PHYSICAL  PROPERTY  DATA 


133 


SUMMARY  OF  FAILURE  MECHANISMS 


Figure  li  Bending  Stage. 
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ABSTRACT.  The  Phase  II  Secure  Voice  Program  {P2SVP)  will  develop, 
acquire'andTnstall  a  high  quality,  effective,  long  haul  DOD  secure 
voice  system  that  will  serve  up  to  10,000  subscribers  in  the  1985  time 
frame  to  provide  requisite  interoperability  with  strategic  and  tactical 
systems.  It  replaces  the  Phase  I  Automatic  Secure  Voice  Communications 
(AUTOSEVOCOM)  Network  and  Interim  Conferencing  for  the  National  Military 
Command  System  (NMCS). 

The  independent  a^my  aiialysis  was  a  unique  effort  because  this 
was  the  first  time  the  Army  was  asked  by  the  Secretary  of  Defense  to 
evaluate  another  agency’s  program. 

The  Director,  Telecommunications  and  Command  and  Control  Systems, 
Office  of  the  Secretary  of  Defense  requested  the  Army  to  prepare 
Independent  Cost  Estimates  (ICE's)  of  the  P2SVP  alternatives  developed 
by  the  DCA  in  support  of  Development  Concept  Paper  (DCP)  #153.  These 
Independent  Cost  Estimates  were  to  be  prepared  for  the  Defense  Systems 
Acquisition  Review  Council,  Office  of  the  Secretary  of  Defense,  Cost 
Analysis  Improvement  Group  (DSARC,  OSD,  CAIG).  This  analysis  provided 
input  for  the  full-scale  engineering  development  decision  point. 

HQ,  DARCOM  established  a  Systems  Study  Group  (SSG),  Chaired  by 
myself,  consisting  of  representatives  (raulti-and  inter-disciplinary) 
from  COA,  CSA,  ACC,  ECOM,  DCA,  NSA,  DCEC,  DDR&E,  DTACCS,  and  OSD. 

This  SSG  generated  an  ICE  by  analyzing  the  Phase  II  computer  printouts 
at  the  Defense  Communications  Engineering  Center  (DCEC),  supported  by 
engineering  judgement,  mathematical  analysis,  expert  opinion  and 
historical  data.  These  estimates  were  prepared  in  accordance  with 
the  Army  Materiel  Guide  for  Organizing  and  Presenting  Cost  Studies, 
and  the  HQ,  Department  of  the  Army  Investment  and  O&S  Cost  Guides  for 
Army  Materiel  Systems. 

1.  INTRODUCTION.  An  analysis  of  P2SV  and  alternatives  was  made 
previously  by  the  Defense  Communications  Agency  (DCA)  in  the  form  of 
an  Economic  Analysis  Estimate  (EAE).  The  ICE  described  in  this  paper 
provides  an  independent  evaluation  of  the  costs  generated  in  that  EAE. 
Such  an  evaluation  is  a  normal  procedure  in  the  acquisition  of  Army 
materiel  systems.  Together  with  the  benefits  (effectiveness)  calculations 
made  in  the  EAE,  it  allows  a  ranking  of  the  candidate  systems  to  be  made 
and  gives  visibility  to  the  decision  maker  of  the  trade-offs  involved. 
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2.  DESCRIPTION  OF  ALTERNATIVES.  Independent  Cost  Estimates 
vrare  made  on  4  alternatives:  Worldwide  Tenley,  Narrowband,  Wideband 
and  Hybrid  Systems  for  the  Phase  II  Secure  Voice  Program.  Summary 
descriptions  are  given. 


TABLE  1  >  SUMMARY  DESCRIPTION  OF  ALTERNATIVES 


I  WORLDWIDE  TENLEY 
1 6  KBPS 

Modified  Autovon  CONUS 
TTC-39  Overseas 
Predominently  Wideband 
Tri-Tac  Type  COMSEC 

III  WIDEBAND 
16  KBPS 

Modified  Autovon  CONUS 
TTC-39  Overseas 
Bellfield  COMSEC  CONUS 
Tri-Tac  COMSEC  Overseas 


II  NARROWBAND 
8  KBPS 

Modified  Autovon  Worldwide 
Bellfield  COMSEC 

Red/Maroon  Interface  with  Tri-Tac 


IV  HYBRID 

16  KBPS  Overseas 
8  KBPS  Conus 
Modified  Autovon  CONUS 
TTC-39  Overseas 
Bellfield  COMSEC  CONUS 
Tri-Tac  COMSEC  Overseas 


The  AN/TTC-39  is  a  family  of  modular  and  transportable  communication 
switching  systems  designed  to  provide  secure  automatic  switching  for 
tactical  voice  and  message  traffic.  The  family  consists  of  hybrid  circuit 
switches  varying  in  size  from  450  to  750  terminations  by  increments  of 
150  analog  or  digital  terminations  and  message  switches  equipped  for  25 
or  50  terminations. 

A  more  detailed  description  of  the  four  alternatives  are  given 
below: 

A.  ALTERNATIVE  I.  The  Worldwide  Tenley  provides  for  16  KBPS  (Wideband) 
continuously  variable  slope  delta  modulation  (CVSD)  terminals  for  all  users. 
Clear,  secure  voice  capability  will  be  provided  from  the  same  16  KBPS 
terminal.  Leased  CONUS  autovon  switches  will  be  modified  to  emulate 
certain  AN/TTC-39  switch  features  and  the  government  owned  switches 
overseas  will  be  replaced  with  AN/TTC-39  type  switches.  Concentrations 
of  subscribers  will  be  provided  access  via  a  new  automatic  4-wire  Digital 
Access  Exchange  (DAX)  concentrator.  End-to-end  encryption  will  be  provided 
for  all  calls  within  the  network,  except  for  conferencing  and  NB/WB  con¬ 
versions  requiring  red  interfaces.  Automatic  remote  electronic  crypto¬ 
graphic  key  distribution  will  be  provided  with  the  Tri-Tac  Tenley  COMSEC 
concept  in  both  CONUS  and  overseas. 
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B  ALTERNATIVE  2.  Alternative  2  provides  8  KBPS  Narrowband  voice 
processor  terminal  s' l^Td  Bellfield  COMSEC  in  both  CONUS  and  overseas 
portions  of  the  DCS.  CONUS  Autovon  switches  will  be  modified  for 
digital  operation  and  Bernhardt  KDC's  will  be  used  and 

overseas.^  A  red  interface  KDC  will  be  required  for  the  DCS  Bellfield 
COMSEC  to  interoperate  with  the  Tri-Tac  Henley  COMSEC.  Overseas,  the 
existing  government-owned  autovon  switches  will  be  modified  for  digital 
operation.  End-to-end  voice  encryption  will  be  maintained  on  intra-DCS 
calls  since  all  users  will  have  compatible  terminals.  However,  calls 
to  Tri-Tac  will  require  8  to  32  KBPS  voice  interfaces  that  will  prohibit 
end-to-end  encryption  and  insert  voice  degradation. 

C  ALTERNATIVE  3.  Alternative  3  provides  a  Worldwide  Wideband 
(16  KBPS)  system  usiTIg  Bellfield  COMSEC  in  CONUS  and  Ifliley  CpMSEC 
overseas.  As  opposed  to  the  Tenley  alternative,  it  will  not  have  COMSEC 
functions  at  each  modified  CONUS  autovon  switch.  Instead,  up  to  3  stand¬ 
alone  Bernhardt  KDC's  will  be  dispersed  throughout  CONUS  toserve  the 
CONUS  DCS.  CONUS  autovon  switches  will  be  modified  to  provide  digital 
service.  The  CONUS  voice  terminal  will  be  procured  to  operate  in  the 
Bellfield  COMSEC  mode.  Overseas,  this  alternative  will  require  a  special 
interface  KDC  to  allow  interoperation  of  the  CONUS  Bell  field  and  the 
overseas  Tenley  key  distribution  systems.  Voice  interoperability  with 
Tri-Tac  subscribers  and  end-to-end  encryption  will  be  available. 


•D.  ALTERNATIVE  4.  Alternative  4,  the  Hybrid  alternative, ^provides 
8  KBPS  Narrowband  Voice  Terminals  with  Bellfield  COMSEC  in  CONUS  and 
16  KBPS  voice  terminals  with  Tenley  COMSEC  overseas.  The  Bellfield 
COMSEC  in  CONUS  will  be  achieved  with  Bernhardt  KDC'/S.  The  CONUS  Secure 
voice  terminals  will  be  the  product  of  a  separate  Narrowband  development. 
CONUS  autovon  switches  will  be  modified  for  digital  operation.  Overseas, 
the  program  will  be  identical  to  the  Wideband  alternative,  except  that 
an  interface  will  be  required  between  the  two  dissimilar  voice  terminals 
of  each  geographic  area.  This  will  preclude  end-to-end  encryption  of 
voice  calls  between  CONUS  and  overseas  DCS  or  CONUS  DCS  and  Tri-Tac,  and 
will  introduce  noticeable  voice  degradation  for  these  calls. 


3.  METHODOLOGY.  The  methodologies  used  in  this  analysis  included 
cost  estimating  relationships,  regression  analysis,  learning  curve, 
engineering  estimates,  analogy,  delphi,  cost  factors,  complexity  factors, 
contractor' quotes,  previous  experience,  and  subjective  judgement. 

The  methodology  employed  for  the  investment  portion  of  the  ICE 
consisted  of  the  formulation  of  the  equipment  requirements  package, 
research  of  available  cost  data,  determination  of  hardware  costs  by 
analogy  and  support  costs  from  historical  information  and  cost 
estimating  guidelines.  The  cost  data  elements  of  the  investment 
analysis  include  hardware,  military  construction,  engineering,  instal¬ 
lation  and  testing,  material,  initial  spares,  test  equipment,  data, 
training,  packing,  packaging  and  transportation. 
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The  operation  and  support  methodology  consisted  of  cost  estimating 
relationships,  computer  models,  expert  opinion,  analogy,  contractor 
quotes,  cost  factors,  and  exponential  regression  analysis.  The  cost 
data  elements  of  the  O&S  analysis  consisted  of  personnel,  consumption, 
training,  integrated  logistics  support,  maintenance,  procurement  of 
switch  modification,  transportation,  recurring  spares,  leasing  and 
uti 1 i ti es . 

The  methodology  for  the  R&D  cost  estimates  were  expert  opinion. 

64  individual  R&D  tasks  were  analyzed  using  a  modified  delphi  technique 
and  a  computer  routine.  The  cost  data  elements  of  the  R&D  analysis 
included  engineering,  tooling  and  prototypes. 

A.  As  an  example  of  the  mathematical  techniques  used  in  estimating 
costs,  an  analysis  of  CONUS  transmission  costs  is  given.  These  AT&T 
leased  lines  will  be  used  for  digital  rather  than  the  usual  analog 
transmission;  thus  there  was  no  relevant  experience  to  obtain  data. 

Two  factors  were  involved  in  the  analysis,  the  first  of  which 
was  the  increase  in  the  number  of  digital  service  areas  expected.  This 
is  expected  to  result  in  a  linear  decrease  in  total  transmission  costs 
of  2%/year  for  10  years.  The  second  factor  anticipates  a  reduction 
in  costs  for  providing  digital  transmission  due  to  technological 
advances  and  increased  equipment  production.  This  decrease  is  expected 
to  start  in  1980  and  is  expressed  by  the  exponential  regression, 

DC=  1/2  (1  +  e 

Where  0<t<10  corresponds  to  the  years  1980  to  1990.  This 
expression  results  from  an  exponential  regression  analysis  using  all 
available  information  on  present  and  past  transmission  leasing  costs. 

B.  The  approach  to  estimating  Operating  and  Support  (O&S)  costs 
was  as  follows.  Operator  costs  were  calculated  by  multiplying  the 
number  of  operators  required  for  each  equipment  by  the  annual  pay  and 
allowance  for  the  operator's  grade  level. 

Maintenance  costs  were  calculated  by  multiplying  the  cost  per 
active  maintenance  man-hour  by  the  total  annual  maintenance  hours 
per  equipment.  Total  annual  maintenance  man-hours  were  calculated  by 


AMMH  =  HOP  (MTTR/MTBF), 

where:  AMMH  =  Annual  Maintenance  Man-hours. 
HOP  =  Hours  of  Operation  Per  Year 
MTTR  =  Mean-time-to  Repair 
MTBF  -  Mean-time-between  Failure 
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Depot  overhaul  costs  for  labor  and  material  were  calculated  by 
multiplying  the  depot  overhaul  cost  by  the  overhaul  rate  to  equal  the 
depot  cost  per  unit  per  year.  The  overhaul  rate  indicated  how  often 
the  unit  was  expected  to  be  sent  back  to  depot  for  overhual .  The 
depot  overhaul  cost  was  estimated  by 

DOC  =  0.809  (DOR)  (UC) 

where:  DOC  =  Depot  Overhaul  Cost/Year 

UC  =  Unit  Hardware  Cost 
DOR  =  Depot  Overhaul  Rate 
Standard  Error  =  +60%,  -37% 

C.  Cost  Estimating  Relationships  (CER)  were  used  to  estimate 
costs  for  various  equipments.  For  example,  the  CER  used  for  the  TTC 
automatic  switching  equipment  was 

Y,  =  27284.7  +  0.002  -  0.125X^^-^  +  24.898X.^-^ 

^  1  2 


where: 


Y^  =  Acquisition  Cost 


=  Weight 


X^  =  Vol ume 

X^  =  Number  of  Lines 

4.  UNCERTAINTY  ANALYSIS.  In  all  cases  of  projected  cost  estimates 
some  degree  of  uncertainty  will  exist  and  it  is  therefore  advisable  to 
state  projected  cost  estimates  in  terms  of  most  likely  value,  lowest 
value,  and  the  most  pessimistic  (highest)  value.  The  most  likely 
value  would  be  that  value  normally  used  in  planning,  programming  and 
budgeting. 


The  ratios  of  high  and  low  values  to  most  likely  (taken  as  1) 
are  given  in  Table  2  below  for  the  preferred  alternative  1  for  R&D 
and  O&S  costs. 


TABLE  2  -  UNCERTAINTY  ANALYSIS  (ALTERNATIVE  1) 


LOW 

MOST  LIKELY 

HIGH 

R&D 

.957 

1 

1.024 

O&S 

.850 

1 

2.054 
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The  uncertainty  in  the  investment  costs  was  analyzed  for  the 
major  equipments.  The  uncertainties  are  given  in  terms  of  percentages 
of  the  most  likely  costs. 


TABLE  3  -  INVESTMENT  UNCERTAINTY  ANALYSIS 


EQUIPMENT 

UNIT  COST 

- 

+ 

AN/TTC-39  Switch 

$1 ,860K 

18% 

8% 

Tenley  Family 

298K 

15% 

15% 

Bell  field  Family 

292K 

25% 

25% 

Loran  C 

20K 

5% 

.5% 

Dax  (Concentrator) 

58K 

15% 

40% 

DSVT 

4.3K 

10% 

10% 

Goldwine  Mod 

10. 9K 

60% 

10% 

Conference  Directors 

273K 

50% 

50% 

Transmission  Equipment 

15% 

15% 

SERVICE  AND  SUPPORT 

Engineer,  Install  and  Test 

10% 

100% 

Repair  Parts 

15% 

15% 

Test  Equipment 

20% 

20% 

Data 

20% 

50% 

Packing,  Packaging  &  Transportation 

15% 

25% 

5.  SENSITIVITY  ANALYSIS.  Cost  sentivity  analysis  is  a  technique  within 
the  context  of  both  individual  system  and  force  structure  cost  analysis. 
It  involves  the  systematic  examination  of  the  effects  of  changes  in  total 
force  structure  cost  resulting  from  variations  in  characteristics,  size, 
and  composition  of  force.  The  variables  considered  in  conducting  the 
sensitivity  analysis  were  the  number  of  subscribers,  manning  levels, 
changes  in  terminals,  logistics  cost,  CONUS,  leasing  costs,  and  planning 
horizons. 

COST  BENEFIT  ANALYSIS.  By  using  standard  methods  of  measuring 
benefits  (measures  of  effectiveness),  benefit/cost  ratios  were 
calculated  for  the  4  alternatives.  The  values  are  given  below: 


TABLE  4  -  COST  BENEFIT  ANALYSIS 


Alternative 

Tenley 

Narrowband 

Wideband 

Hybrid 


Benefit/Cost 

484 

305 

378 

296 
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7  SUMMARY  COSTS.  The  table  below  gives  the  summary  costs  in  both 
constant  and  TnfTated  FY76  dollars.  The  inflated  costs  of  over  a  billion 
dollars  is  a  large  but  not  untypical  program  for  our  analysis  and 
evaluation. 


table  5  >  ICE  P2SV  GENERAL  COST  SUMMARY 


(Constant  76  $  M) 

ALTERNATIVES 

m 

Investment 

O&S 

TOTAL 

1 

II 

III 

IV 

STTF 

179.8 

569.7 

39.3 

209.3 

495.2 

37.6 

173.1 

555.5 

38.8 

248.8 

570.4 

787.1 

743.8 

766.2 

858.0 

(Inflated 

76  $  M) 

R&D 

Investment 

O&S 

TOTAL 

44.2 

235.9 

1026.2 

1306.3 

45.6 

279.3 

882.3 
1207.2 

44.2 

230.2 

1000.9 

1275.3 

45.7 

329.2 

1017.1 

1392.0 

8.  CONCLUSION.  In  this  presentation  I  have  attempted^to^give  the^ 
highlights  of  the  Army's  independent  analysis  of  the  P2SyP,  as  well  as 
some  of  the  complementary  calculations  used  in  an  economic  analysis  that 
are  needed  in  the  decision  acquisition  process. 
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ABSTRACT.  The  solution  of  the  class  of  prdblans  govern^  ^ 
a  set  of  first  order  linear  differential  equations,  subject 
to  a  set  of  linear  constraints,  and  the  minimization  of  a 
defined  quadratic  performance  index  is  presented.  The  number 
of  differential  equations  must  be  greater  than  the  number 
of  constraints,  otherwise,  there  is  a  unique  solution  and 
control  is  not  possible.  The  solution  is  considered  as 
known  once  the  correct  initial  conditions  are  found;  a 
number  of  initial  value  methods  are  available  to  solve 
linear  differential  equations.  Only  discrete  controls 
are  considered  here,  depicting  the  real  world  vhere  contin¬ 
uously  variable  controls  are  not  alvays  present.  Usmg  the 
above,  systems  of  the  open  Icop  type  are  examined. 

The  method  consists  of  sv:5)erposition  of  linearly  inde¬ 
pendent  particular  solutions  to  get  the  optimal  solu-^on.  The 
particular  solutions  are  generated  using  a  power  series 
integration  technique  on  a  perturbed  set  of  arbitrarily  chosen 
initial  conditions.  The  superposition  constants  ^e  deter¬ 
mined  so  that  the  solution  both  meets  the  constraints  and 
minimizes  the  quadratic  performance  ind^.  The  minimum  point 
is  found  usirg  a  method  developed  by  Childs  and  M^n  for^ 
the  explicit  minimum  solution  to  a  set  of  quadratic  equations 
subject  to  a  set  of  linear  constraints. 
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1.  introduction.  The  solution  to  a  class  of  control  problems  with  discrete 
®^^^tls  is  presented.  The  class  of  problons  examined  are  those  governed  by 
a  set  of  n  first  order  linear  ordinary  differential  equations,  subject  to  a 
set  of  m  linear  constraints  (iTi<n) ,  vdierein  a  given  quadratic  performance 
index  is  to  be  minimized.  The  discrete  controls  appear  in  the  solution  as 
initial  values  of  the  differential  equations.  Those  initial  values  viMch  are 
unspecified  by  constraints  are  determined  optimally  by  minimizir^  the  given 
quadratic  performance  index. 

The  letter  y  is  used  for  an  n  element  state  variable  vector  which  is 
assumed  to  be  a  function  of  tlie  irdependent  variable  t,  time.  The  dot  (*) 
is  used  to  denote  the  total  derivative  with  respect  to  t.  The  general  set 
of  fi^st  order  linear  ordinary  differential  equations  is  written  as 

y  =  Ly  +  f  t  z  [o,T]  (1.1) 

where  L  is  a  n  by  n  coefficient  matrix  vdrose  elements  may  be  constants  or 
functions  of  time,  f  is  an  n  element  vector  of  forcing  functions,  and 
is  the  time  interval  of  interest.  The  solution  of  equation  (1,1)  is  subject 
to  the  linear  equality  boundary  conditions  or  constraints 

i  =  (1.2) 

represents  the  boundary  condition  operator  that  specifies  a  linear 
ccmbination  of  elanents  of  the  state  vector.  The  ith  boundary  value, 
at  tte  ^ecified  value  of  time,  t^.  A  quadratic  perfonrance  index,  h,  where 

T 

h  =  \  y<  M  y  dt  (1^3) 

o 
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is  to  be  minimized.  The  n  by  n  matrix  M  is  synmetric  and  Joiown  function  of 
time,  t.  The  prime  (  ) '  is  used  to  indicate  the  transpose  of  a  vector  or 
matrix.  The  above  three  ecjuations  define  the  basic  problsn. 

The  solution  to  the  problem  is  uniquely  defined  once  the  state,  y,  is 
known  at  a:^  time  t.  The  solution  is  considered  as  known  once  the  correct 
initial  conditions,  y(o),  are  known.  T!!nBy(o)  vector  gives  the  desired  control 
parameters,  and  it  can  be  used  with  the  differential  equation  (1.1)  to  generate 
an  accurate  solution  for  y  as  a  function  of  t.  This  is  due  to  the  availability 
of  a  variety  of  initial  value  differential  equation  problem  solvers  for  today's 

digital  ccnpubers. 

The  solution  method  is  a  siperposition  of  solutions,  a  "shooting  method".  [6] 
The  usual  methods  of  solving  similar  controls  problans  involve  the  use 
of  Lagrar^e  multipliers,  Hamiltonians,  co-state  equations,  etc.  which  are 
unnecessary  in  the  method  presented  in  this  paper.  [5]  The  techniques  used 
in  the  usual  methods  require  a  large  amount  of  mathatatical  gyrmastics  in  the 
solution  process. 

2.  A  SHCXDTIN3  METHOD.  A  particular  solution  of  equation  (1.1)  is  a  solution 
of  a  partioutar  set  of  initial  conditions.  We  define  such  a  solution  as 

f  p^^^  f  /  (2.1) 

vhere  the  si^Jerscript,  k,  is  an  index  vhich  denotes  the  kth  particular  solution. 
The  state  vector,  y,  is  determined  by  the  superposition  of  particular  solutions, 
and  is  expressed  as 

y  =  Pa  (2-2) 

149 


4 


v^ere  the  /cth  column  of  the  matrix  P  is  the  state  vector  of  equation 
(2.1)  and  the  vector  a  is  the  vector  of  superposition  constants.  The  index, 
k,  for  the  vector  a  and  the  columns  of  P  varies  frcm  zero  to  r,  viiere  r  is 
the  number  of  differential  equations  minus  the  number  of  known  initial 
conditions.  Equation  (2.2)  can  be  rewritten  as 


y  =  Ip 

k=0 


(2.3) 


If  equation  (2,1)  is  multiplied  by  a,  and  summed  over  k 


V 

I  P 

k=0 


=  I  L  p 
k=0 


T 

+  I  f 

k=0 


(2.4) 


ItewritLng  after  factoring  out  L  and  f  from  the  summations  (since  they  are  not 
indexed  by  k)  and  substituting  equation  (2.3)  and  the  derivative  of  equation 
(2.3)  with  respect  to  t  into  equation  (2.4)  gives 


y  =  L  y  +  f  la.  (2.5) 

k=0  ^ 

Comparing  equations  (1.1)  and  (2.5)  establishes  a  constraint  vdiich  the  super¬ 
position  constants  must  iteet: 


1.0 

The  traditional  superposition  of  honogeneous  solutions  on  a  single  particular 
solution  does  not  have  a  similar  constraint.  However,  because  we  superiitpose 
particular  solutions,  we  need  to  program  only  one  set  of  equations  for  each 
problon. 

IniepeYulenoe-  of  SoVwt'Lons  atid  Boutidavy  VaTue  Constraints.  The  reason  for  the 
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superposition  of  the  particular  solutions  is  to  satisfy  the  boundary  conditions 
or  constraints.  This  requires  all  r+1  subsets  of  the  r  particular  solutions  to 
be  linearly  independent.  Tb  insure  this,  P(o)  is  created  using  the  perturba¬ 
tion  strategy:  Firsts  arbitrary  estimates  are  made  of  the  r  unknown  values  of 
y(o)  and  this  vector  is  used  for  (o).  Second,  columns  1  through  r  of 
are  generated  by  making  each  column  the  same  as  p  ^  ^  (o) ,  except  that  each  has 
one  nonzero  perturbation  from  one  of  the  estimated  elements  of  p  ■  (o).  Each 
estimated  elanent  is  perturbed  in  one  and  only  one  column.  This  strategy 
gives  the  desired  independence. 

The  boundary  conditions  are  of  the  form  specified  in  equation  (1.2) . 

For  control  problems,  these  boundary  conditions  are  usually  initial  conditions, 
but  this  is  not  required.  As  stated  previously,  r  denotes  the  number  of  elements 
of  y(o)  not  uniquely  specified  by  equation  (1.2),  and  thus,  (n-r)  elements  of 
y(o)  are  uniquely  specified.  If  m  is  not  equal  to  (n-r),  then  there  are 
m-(n-r)  boundary  conditions  at  times  greater  than  zero.  Subsittution  of 
(2.2)  into  (1.2)  gives 

q-  (P(t.)  a)  =  h  .  i  =  1,2, ...  ,m  (2. 

v  1- 

which  can  be  rewritten  for  linear  operators  as 

I  q  -b^  (2. 

k=0  ^ 

of  these  m  linear  equations,  (n-r)  specify  known  values  of  y(o)  and  m-(n-r) 
specify  constraints  on  the  unknown  values  of  y(o)  in  terms  of  the  (r+1) 
unknown  superposition  constants,  the  With  the  addition  of  constraint 

equation  (2.6),  there  are  m-(n-r)+l  constraints  with  (r+l)  uriknowns.  Since  the 
problem  statsnent  declares  that  m  is  less  than  n,  it  is  evident  that  m-(n-r)-/-l 
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is  less  than  r+1 ,  and  thus  it  is  an  underdetermined  systan.  Therefore,  the 
a^'s  are  not  imiquely  specified,  and  we  can  choose  than  to  minimize  the  per- 
fomance  index  of  equation  (1.3). 

Optimizing  on  the  Basis  of  the  Quadratic  Performance  Index.  The  a  vector  is 
now  included  in  the  performance  index  by  the  substitution  of  equation  (2.2) 
into  equation  (1.3)  vdiich  gives 


h  =  j  a'  P'  M  Pa  dt 
o 

The  (rpl)  by  (r+1)  matrix  A  is  defined  by: 

T 

A  =  J  P'  M  P  dt 
o 

It  is  possible  to  rewrite  equation  (2.9)  as 
hCaJ  =  a'  A  a 

The  method  that  is  used  to  solve  for  ,4  is  to  calculate  the  solution  of  the 
initial  value  problem 


(2.9) 


(2.10) 


(2.11) 


A  =  P’  M  P 


A(o)  =  0 


(2.12) 


The  superposition  equation  (2.3)  requires  that  (r+l)*n  first  order  linear 
ordinary  differential  equations  be  integrated  and  the  matrix  A,  vdrLch  is  symmetric 
because  the  itatrix  M  is  symnetric,  may  be  determined  by  integrating  an  additional 
(r+1)  *  (r+2)/2  first  order  linear  ordinary  differential  equations. 

In  solving  for  the  optimum  a  vector,  the  explicit  formula  developed  by 
Childs  and  Maron  (1975)  is  utilized.  This  formula  states  that  the  solution  for 
a  such  that 


h (a)  =  a’  A  a  =  mi 


m^n^■mum 


(2.13) 
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subject  to 


Ka  =  o 


(2.14) 


IS 


a  =  a  -  N(N'  ANf^  N’  A  a 


(2.15) 


vdiere  k  is  m-(n-r)+l  by  (r+1)  and.  of  rank  m-(n-r)+l^  is  a  particular  solution 
of  equation  (2.14) ,  and  the  columns  of  H  form  a  basis  for  the  null  space  of 
K.  The  (  )  “^  in  equation  (2.15)  denotes  a  matrix  inverse.  By  using  appro¬ 
priate  matrix  operations,  it  is  possible  to  transform  equation  (2.14)  into  the 
equivalent  system 


1  I 

I 


a  =  d 


This  can  be  used  to  define  a  and  N  as 

Jr 


a  = 
p 


and  N  = 


[=--] 


The  I*s  in  equations  (2.16)  and  (2.17)  are  identi-ty  matrices  of  appropriate 
order. 

3.  AN  EXflMPLE.  The  first  problem  chosen  is 


(2.16) 


(2.17) 


X  +  0.2x  +  X  =  +  u^t 


t  e  [0^10] 


(3.1) 


subject 


x(o)  =  0 


x(10)  =  1 


x(o)  =  0 


(3.2) 
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and 


10 

h  =  I 


o 

In  state  variable  fom, 


'2 

+  X  )  dt  =  m-invmm 
this  can  be  restated  as 


yf  y2 

-yi  -  ^y2  -^yz-^ 

h  =  ^ 

1/4  =  0 


(3.3) 


(3.4) 


subject  to 


y^(o)  ^  0  y^(lO)  ^  1  y^(o)  =  0 


(3.5) 


where 

10 

h  =  /  y'  M  y  dt  (3.6) 

o 

and 


(1  0  0  0  \ 

0  1  0  0  \ 

0000) 
0  0  0  0  / 


(3.7) 


Using  an  accuracy  of  10  ^  and  evaluating  power  series  to  10  terms  results  in 
the  following  solution 


y^(o)  =  0. 

=  0. 

y^(o)  =  -0.105453 
y^(o)  0.115041 
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The  y  and  y  elements  are  the  forcing  function  constants  or  control. 

■The  solution  for  and  y^  over  the  interval  [0,10]  is  given  in  Table  1. 

4,  CDNCnJSIONS.  A  direct  method  has  been  shown  for  the  solution  of  linear 
ordinary  differential  equations  subject  to  minirni2ation  of  a  quadratic  per¬ 
formance  index  and  multipoint  boundary  values.  The  method  avoids  the  necessity 
of  Lagrarge  multipliers  and  other  similar  tools. 

The  method  can  easily  be  incorporated  into  boundary  value  codes.  I4ost 
problems  will  have  nonlinearities  vdiich  can  be  handled  in  the  usual  manner 
[3],  [6]. 
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TABLE  1 


NUMERICAL  SOLUTIONS 


Time 

0 

0.5 

1.0 

1.5 
2.0 

2.5 
3.0 

3.5 
4.0 

4.5 
5.0 

5.5 
6.0 

6.5 
7.0 

7.5 
8.0 

8.5 
9.0 

9.5 
10.0 


0 

-0.010 

-0.028 

-0.035 

-0.018 

0.031 

0.111 

0.213 

0.325 

0.434 

0.527 

0.598 

0.646 

0.675 

0.694 

0.713 

0.740 

0.782 

0.841 

0.916 

1.000 


^2 

0 

-0.035 

-0.031 

0.006 

0.066 

0.130 

0.186 

0.219 

0.225 

0.205 

0.166 

0.118 

0.075 

0.045 

0.034 

0.043 

0.068 

0.102 

0.135 

0.161 

0.173 


=  -0.105 

y^  =  +0.115  (constants) 
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On  Generalized  Feller  Equation 
Siegfried  H.  Lehnigk 

Physical  Sciences  Directorate,  US  Arny  Missile  Command 
Redstone  Arsenal ,  AL 


ABSTRACT 

The  generalized  Feller  equation 

« ( z)  =  Az  +  Bz  +0  ”  z  =  0,  z  =  z(x, t) ,  X  >  0,  t  >  0, 


with  the  coefficients 


A(x) 

B(x) 

C(x) 

will  be  considered, 
equation. 


=  cyx^"*”^,  Of  >  0,  XeR,  X  Ij 

=  P  j^x^  + 

=  px^  ^  +  P2J  p  “  X[P^  -  cv(l  +  X)]j 
The  choice  of  p  makes  Ji(z)  =  0  a  Fokker-Planck 


Solutions  of  Ji(z)  =  0  will  be  derived  for  given  initial  and/or 
boundary  conditions.  The  derivation  of  initial  condition  solutions 
is  based  on  a  basic  solution  of  «,(z)  =  0  and  its  adjoint. 


The  complete  paper  is  published  elsewhere. 
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A  PERTURBATION  METHOD  FOR  FREE  BOUNDARY 
PROBLEMS  OF  ELLIPTIC  TYPE* 

B.  A.  Fleishman  and  Thomas  J,  Mahar*^ 
Department  of  Mathematical  Sciences 
Rensselaer  Polytechnic  Institute 
Troy^  New  York  12181 


ABSTRACT.  Nonlinear  partial  differential  equations  (PDE  s) 
arise  in  ma'i^  scientific  contexts,  and  boundary  value  problems 
(BVP's)  for  such  equations  present  formidable  _ 

ficulties.  Thus  analytical  techniques  for  approximating  the  sol 
utions  of  such  problems  have  practical  significance. 

A  formal  perturbation  method  is  described  here  for  approxi 
matinq  solutions  of  certain  BVP*s  for  elliptic  PDE's  containing 
discontinuous  nonlinearities.  To  illustrate,  we  treat  in  detail 

the  BVP 


P(e) 


u  +  u  „  +  f  (u)  =  0 

XX  yy 


u(0,y)  =  eh(y),  u^(l,y)  -  0 


in  S:  0  <  X  <  1, 


for  -oo  <  y  < 


where  e  is  a  small  parameter,  h  is  periodic  and  uniformly  bounded, 
-F  TO  ^  c*+-on-f unction;  f(u)  =0  for  u  <  y,  f(u)  -  l  tor  u  ^  y 
(u\  positive  constant)  .  u  and  8u/9n  are  to 

any  "free  boundary”  u  =  v  If  0  <  u  i  1/4,  problem  P(0)rs  shown 

to^have  at  least  one  non-trivial  solution  Uq  -  Uq(x) 
u  (x)  =  y  (0  <  X  <  1)  .  y  £  (0,1/4)  an  approximate  soluti 

u?x  v)  of  P(e)  involving  a  free  boundary  in  S  is  then  sought  in 
tii'firm  u(x'y)  =  Uo{x)%  eu(x,y),  with  the  free  boundary  assumed 

to  be  X  =  X  +  eg(y)  • 

TWO  examples  are  considered,  h(y)  =  cos  y  ^ 

ric  polynomial,  in  which  the  linear  (variational)  equation  for  u 
may  be  solved  by  separation  of  variables. 

An  unusual  feature  of  our  procedure  is  that 
u  contains  a  delta-function  coefficient,  because  in  the  originia 
equation  f  is  a  step-function  in  u. 

1.  INTRODUCTION.  Nonlinear  partial 
(PDE's)  arise  in  many  scientific  contexts,  and  boundary  va  ue 


Research  supported  by  U.  S.  Army  Research  Office. 

Present  address:  Courant  Institute  of  Mathematical  Sciences, 
New  York  University,  251  Mercer  Street,  New  York,  New  York 

10012  (U.S.A.) 
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problems  (BVP ' s)  for  such  equations  present  formidable  computa- 

•  Thus  analytical  techniques  for  approximating 
the  solutions  of  such  problems  have  practical  significance. 


We  illustrate  here  a  perturbation  method  applicable  to  cer¬ 
tain  BVP's  for  elliptic  PDE's  of  the  form 


Au  +  f (x,u)  =  0  (1) 

where  x  =  (x^,...,Xj^)  is  a  point  in  A  denotes  the  Laplacian 

operator,  u  is  a  real  scalar  variable,  and  f  is  a  piecewise- 
continuous  function  of  x^,...,x^  and  u.  When  f  has  jump  discon- 

tinuities  with  respect  to  u,  among  the  interfaces  across  which  f 
changes  abruptly  there  may  be  so-called  "free  boundaries"  which 

are  not  known  a  priori  but  must  be  found  along  with  the  solution 
u  =  u(x)  . 


Suppose  f  is  a  step-function  in  u  and  depends  also  on  m  of 
the  independent  variables,  say,  Xj^,...,Xj^  ,  where  0  <  m  <  n.  Let 

D  be  a  fixed  region  in  r”  whose  bounding  surfaces  are  independent 
^m+1' * • • '^n  * 


Now  consider  a  BVP  for  (1)  on  D,  denoted  by  P(e),  in  which  a 
^  occurs  in  the  boundary  conditions  in  such  a  way 
that  the  reduced"  problem  P(0)  does  not  involve  x  x 

T-P  1  /  m+1  '  n* 

a  so  ution  (^2.' *  *  * P(0)  is  obtainable,  we  seek  a 

solution  of  P(e)  in  the  perturbed  form  u  =  Uq  +  eu,  with  free 

boundaries  (if  any)  which  are  perturbations  of  free  boundaries  of 
P(0).  As  we  shall  see  in  the  specific  problem  considered  below, 
for  certain  boundary  data  it  is  easy  to  find  u  and  the  perturbed 
free  boundary. 


The  unusual  mathematical  feature  of  this  procedure  is  that 
we  perturb  about  a  surface  of  discontinuity,  which  introduces  a 
delta  function  into  the  (variational)  equation  satisfied  by  u. 

Our  development  is  formal;  assuming  that  the  solution  we  seek  exists 
an  that  it  can  be  closely  approximated  by  an  expression  of  the 
form  Ug  +  eu,  etc.,  we  calculate  u  and  the  modified  free  boundary. 


for  equations  similar  to  (1)  occur  in  plasma 
physics;  in  [1],  for  example,  the  authors  consider  equations  of 
the  fonn  Lu  +  f(x,u)  =  0,  where  L  is  an  elliptic  operator  and  f 
IS,  however,  piecewise-linear  in  u,  not  discontinuous.  Free  bound¬ 
ary  problems  for  equations  of  the  form  div  (K  grad  u)  =  0,  where 
^  ^  piecewise-continuous  function  (which  arise  in  the 

equilibrium  Stefan  problem  [2]  and  govern  certain  diffusion  and 
metallurgical  processes)  are  also  being  investigated  by  the  method 
Illustrated  here. 


.  Besides  occurring  naturally,  problems  with  discontinuous  non- 
sometimes  introduced  as  approximations  (e.g.,  see 
LlJ)  to  problems  with  smooth  nonlinearities  (which,  in  general, 
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can  not  be  solved  explicitly) .  The  authors  are  investigating  the 
feasibility  of  deriving  approximate  solutions  of  BVP's  for  equa¬ 
tions  of  type  (1)  in  which  f  is  bounded  and  has  smooth  dependence 
on  u,  by  first  replacing  the  smooth  function  f  with  one  which  is 
a  step- function  in  u,  then  employing  the  procedure  described  here 
to  treat  the  approximating  problem.  In  this  connection  it  is  im¬ 
portant  to  note  that  if  the  perturbation  procedure  is  applied 
directly  to  an  equation  of  the  form  (1)  containing  a  smooth  non¬ 
linearity  f,  the  variational  equation  (to  be  solved  for  u)  will 
always  have  variable  coefficients. 

The  remainder  of  this  paper  (Sections  2,  3,  4  and  5)  is 
devoted  to  applying  the  perturbation  technique  to  the  particular 
BVP  consisting  of  equations  (2)  and  (3)  below. 

2.  A  PARTICULAR  FREE  BOUNDARY  PROBLEM.  Let  US  denote  by 
P(e)  the  following  two-dimensional  BVP  for  a  nonlinear  PDE  in  the 
vertical  strip 


S  =  {(x,y):0  <  X  <  1,  -«>  <  y  <  «■}: 

Au  +  f (u)  =  0  in  S  (2) 

P(e) 

u(0,y)  =  eh(y)  ,  u^(l,y)  =0  (-«,  <  y  <  «,)  (3) 

Here  A  =  3^/3x^  +  a^/3y^,  h  is  a  given  continuous,  bounded,  per¬ 
iodic  function,  e  ^  0  is  a  (small)  constant,,  and  f  is  a  step- 
function  with  given  threshold  value  y  ^  0: 


u  <  y 
u  >  y 


(We  could  also  write  f(u)  =  H(u  -  y) ,  where  H  is  the  Heaviside 
unit  f\anction. ) 

Solutions  of  P(e)  will  be  required  to  be  periodic  and 
(therefore  bounded)  in  the  closure  of  S.  In  particular,  then,  u 
and  its  normal  derivative  3u/3n  must  be  continuous  across  any  free 
boundary  (not  known  a  priori) ,  where  u  =  y. 


Suppose  that  h  is  bounded  by  1,  also  that  0  ^  e  <  y«  Then 
by  continuity,  u  <  y  at  points  of  S  close  to  the  left  boundary 
X  =  0.  If  u  <  y  throughout  S,  f  =  0  in  S  and  P(e)  is  a  (linear) 
BVP  for  Laplace's  equation.  For  solutions  satisfying  u  >  y  some¬ 
where  in  S,  however,  P(e)  is  not  linear.  Analysis  of  the 
"reduced"  problem  P(0)  (see  Section  3)  suggests  that  for  small 
positive  y,  P(e)  possesses  solutions  of  both  the  linear  and  non¬ 
linear  problems. 

The  nonlinear  case  of  P(e)  is  of  interest  here .  We  seek 
an  approximate  solution  in  the  form  u  =  Uq  +  eu,  where  Uq  is 

a  (known)  solution  of  the  (one-dimensional)  nonlinear  problem 
P(0);  likewise  free  boundaries  in  P(e)  are  assumed  to  be  pertur¬ 
bations  of  the  free  boundaries  in  P(0) .  In  Section  3  we  obtain 
the  solution (s)  of  P(0)  for  all  non— negative  values  of  y;  in  par- 
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ticular,  it  is  shown  that  when 
solutions . 


0  £  n  £  j  there 


are  non-trivial 


In  Section  4  we  perturb  the  PDE  (2)  about  Uq  and  obtain 

the  (linear)  variational  equation  for  u.  In  Section  5,  taking 

=  cos  y,  we  solve  the  BVP  for  u  by  separation  of  variables. 
The  free  boundary  is  determined  by  substituting  for  x,  in  the 
interface  condition 


u(x,y)  =  Uq(x)  +  eu(x,y)  =  y  , 

the  assumed  form  x  =  x  +  eg(y),  where  Uq (x)  =  p  (that  is,  x  =  x 

is  the  interface  for  the  reduced  problem)  and  g  is  the  periodic 
function  which  we  must  find.  Also  for  more  general  boundary  data 
(namely,  h  a  trigonometric  polynomial)  the  variables  can  be 
separated;  in  this  case  we  merely  sketch  the  procedure. 

- ANALYSIS  OF  P(0)  .  When  e  =  0,  the  boundary  conditions 

(3)  are  both  independent  of  y;  thus  P(0)  reduces  to  the  following 
one- dimensional  problem: 


P(0) 


u"  +  f(u)  =0 
u(0)  =  0,  u' (1)  =  0 


in  I;  0  <  X  <  1 


(4) 


where  d/dx.  We  shall  find  all  solutions  for  y  >  0. 

Note  first  that  all  solutions  are  non-negative,  because 

£  0  on  I.  The  latter  follows  from  the  facts 
that  u  -  -  f(u)  £  0  (wherever  u"  exists)  and  u' (1)  =  0. 

When  y  =  0,  (4)  takes  the  form  u"  =  -1  on  I.  Then  P(0)  has 
the  unique  solution  u(x)  =  x  -  x2/2. 


For  any  fixed  y  >  0,  P(0)  has  the  trivial  solution  u(x)  =  0. 

In  order  for  a  non-trivial  solution  to  exist,  it  is  necessary  that 
there  be  a  smallest  value  x  in  I  such  that  u(x)  =  y.  Then  u(x)  <  y 
for  0  £  X  <  X  and  (since  u' (x)  >0)  u(x)  £  y  for  x  <  x  <  1.  There¬ 
fore  a  non-trivial  solution  of  P(0)  must  satisfy  “  “ 

u"  =  0  for 

u"  +  1  =  0  for 

plus  the  boundary  and  continuity  conditions 
u(0)  =  0  ,  u' (1)  =  0 

u(x+)  =  u(x-)  =  y,  u' (x+)  =  u' (x-) 
for  some  x  in  I. 


0  <  X  <  X 
X  <  X  <  1 


(5) 


_  Solving  the  differential  equations  (5)  on  their  respective 
intervals,  then  subjecting  them  to  conditions  (6) ,  we  find  that 
for  y  >  0, 
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(7) 


UqCx) 


(1  -  x)x 

X  -  +  x^) 


(0  £  X  £  x) 
(x  <  X  £  1) 


is  a  solution  of  P(0)  provided  x  (0  <  x  <  1)  satisfies 
x(l  -  x)  =  li  . 

This  quadratic  equation  has  distinct  real  roots  x  in  I  when 
0  <  y  <  1/4,  the  double  root  x  =  1/2  when  y  =  1/4,  and  complex 
roots  when  y  >  1/4 . 

We  can  now  describe  the  numbers  and  types  of  solutions 
of  P(0)  for  all  non-negative  values  of  y: 

2 

y  =  0:  Unique  solution:  u(x)  =x-x/2. 


0  <  y  <  1/4;  Three  solutions:  the  trivial  one  plus  two 
solutions  given  by  (7) ,  each  corresponding  to  a  different 
root  of  (8)  . 


y  =  1/4;  Two  solutions;  the  trivial  one  plus  one  given 
by  (7)  when  x  =  1/2. 

y  >  1/4:  Unique  solution:  u(x)  =  0. 


4.  THE  PERTURBATION  PROCEDURE.  Henceforth  our  attention  is 
restricted  to  values  of  y  e  (0,1/4). 

As  seen  in  Section  3,  for  each  such  y,  P(0)  possesses  two 
non- trivial  solutions  in  addition  to  the  trivial  one.  Focussing 
on  the  nonlinear  case,  we  have  reason  to  expect  (see  [4])  that 
there  exists  a  solution  of  P(e)  close  to  at  least  one  of  the 
non-trivial  solutions  Uq(x)  of  P(0). 

For  fixed  y  e  (0,1/4),  let  Ug(x)  be  the  solution  (7)  of 

P(e)  corresponding,  say,  to  the  smaller  root  of  equation  (8);  thus, 
0  <  X  <  1/2.  (The  formal  calculation  which  follows  is  the  same  for 
either  root.)  We  shall  assume  that  the  y-periodic  solution  of 
P(e)  close  to  this  Uq(x)  can  be  written,  neglecting  terms  which  are 
O(e^), 

u(x,y)  ~  Uq(x)  +  eu(x,y)  ,  (9) 

where  u  is  a  function  periodic  in  y  and  uniformly  bounded  in 
the  closed  strip. 

Similarly,  we  assume  that  the  solution  (9)  has  a  free  bomdary 
which  may  be  represented 

X  =  X  +  eg  (y)^ 

that  is,  as  a  perturbation  of  the  "free  boundary"  x  =  x  in  Uq(x). 
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Siabtracting 
that  (formally) 


AUq  +  fCug)  =  0  from  Au  +  f(u)  =0  and  noting 


f(u)  =  f(uQ  +  eu)  »  f(uQ)  +  f'(uQ).eu 

we  obtain  the  variational  equation 

Au  +  f ' (Uq)u  =  0  , 

o:c 


(11) 


where  we  have  used  the  identities 

f  (uq(x)  )  =  H'  [Uq(x)  -  y]  =  6  [Uq(x)  -  y]  =  6(x  -  x)/u^(x)  . 

tions  ^on^u*  ~  ^0^^^  ~  ®  follow  the  boundary  condi- 


u(0,y)  =  h(y)  ,  1^(1, y)  =0  (-<«  <  y  <  00) 


(12) 


r,  section  two  examples  are  considered  in  which 
h  actually  varies  with  y  in  a  periodic  fashion.  First  we  can 
gain  some  confidence  in  the  validity  of  the  perturbation  pro¬ 
cedure  from  consideration  of  the  simple  example 


h(y)  =  e 


(0  <  e  <  y) 


x.o.  example  P(e)  is  itself  a  one-dimensional  problem; 

we  are  still  interested  in  the  nonlinear  case.  Without  giving 
details  we  point  out  that  if  first  one  solves  P(e)  exactly  (by 
an  analysis  similar  to  that  of  P(0)  in  the  previous  section), 
then  seeks  an  approximate  solution  in  the  form  u  =  u„  +  eu 
V7ith  interface  x^  =  x  +  eg  (by  solving  the  BVP  (11  -®12)),’^one 

finds  that  the  latter  expressions  agree  with  the  exact  representa¬ 
tions  for  u  and  x^  through  terms  of  first  order  in  e. 

^  EXAMP^S.  We  give  two  examples  in  which  the  linear  BVP 
(11  12)  can  be  solved  by  separation  of  variables. 

EXAMPLE  1:  In  P(e)  let 

h(y)  =  cos  y 

Substitution  in  (11)  and  (12)  of 
u(x,y)  =  v(x)  cos  y 
yields  the  BVP 


v"  -  V  + 


6  (x  -  x) 


V  = 


(x) 

v(0)  =1  ,  V' (1)  =  0 


(0  <  X  <  1) 


(13) 
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The  differential  equation  in  (13)  implies  a  jump  condition 
at  X  =  X.  Suppose  v(x)  is_a  solution  continuous  on  [0,1].  Inte¬ 
grating  the  equation  from  x-ntox+ri  (n  small  and  positive)  , 
then  letting  n  ->0,  we  find  that  the  slope  of  v(x)  undergoes  a 
jump  at  X  =  x; 

v' (x+)  -  v'(x-)  =  -  v(x)/X  ,  (14) 


where 


X  =  u^(x)  =  1  -  X  . 

Now  solving  v"  -  v  =  0  on  each  of  the  intervals  0  <  x  <  x 
and  X  <  X  <  1  (so  that  we  have  four  arbitrary  constants) ,  then 
imposing  the  boundary  conditions  from  (13) ,  the  jump  condition 
(14)  and  the  continuity  condition  v(x+)  =  v(x-) ,  we  obtain  for 
BVP  (13)  the  continuous  solution 

cosh  X  +  A  sinh  x  (0  £  x  <  x) 

v(x)  = 

B  cosh  (1  -  x)  (x  £  X  £  1) 


where 

A  =  B [i  cosh  X  cosh  (1  -  x)  -  sinh  1] 

X  (16) 

B  =  [cosh  1  “  Y  sinh  x  cosh  (1  -  x) ]  ^ 

We  seek  the  free  boundary,  for  the  solution  u(x,y)  given  approxi 
mately  by  (9) ,  as  a  perturbation  of  x  =  x,  the  free  boundary  for 
Uq(x).  In  other  words,  it  is  assumed  that  u  =  y  along  a  curve 

X  =  X  +  eg(y)  , 

2 

where  g  is  a  periodic  function  and  terms  of  order  e 
are  neglected. 

Substitution  of  x  +  eg(y)  for  x  in 


Uq (x)  +  ev(x) cos  y  =  y 


gives 


Uq(x  +  eg(y))  +  ev(x  +  eg (y)) cos  y  =  y  , 

Uq(x)  +  u^(x)  •  eg(y)  +  ev(x)cos  y  +  0(c  )  =  y, 

eXg(y)  +  ev(x)cos  y  s  0  , 

where  we  have  used  Uf.(x)  =  y,  Uq(x)  =  X,  and  the  fact  that  while 

V  is  not  differentiable  on  [0,11  it  is  Lipschitzian.  Finally 
from  (18) 

g(y)  =  -  (v(x)/X)cos  y  =  “  y  cosh  (1  —  x)  cos  y  .  (19) 
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It  should  be  remarked  that  u(x,y)  =  Uq (x)  +  ev(x)cos  y, 

where  Uq  and  v  are  given  by  (7)  and  (15)  respectively,  is  not 

C  in  S,  as  reguired.  It  is  only  when  we  adjust  the  (free) 
boundary  between  the  left-  and  right-hand  regions,  by  wiggling 
the  interface,  that  we  obtain  a  (approximate)  solution. 

To  sum  up,  for  given  u  e  (0,1/4)  and  0  <  e  <  y  we  have  de¬ 
rived,  by  a  formal  perturbation  scheme,  an  approximate  solution 
of  P(e),  which  is  and  periodic  in  y,  of  the  form 


u(x,y) 


x)  X  +  e  (cosh  X  +  Asinh  x)  cos  y,  0  ^x  £x  +g(y) 


(12—2  — 

X  -  -^(x  +  X  )  +  eB  cosh(l  -  x)cos  y,  x  +  g(y)  £x£l 

where  x  is  the  smaller  root  of  (8),  while  A,  B  and  g(y)  are  given 
by  (16)  and  (19)  respectively. 


It  may  be  shown,  finally,  that  the  requirement  that  9u/3n  be 
continuous  across  the  interface  is  satisfied  to  within  terms  of 
order  e  ^ . 


EXAMPLE  2:  In  P(e)  let 

N 

h(y)  =  a.  +  z  (a  cos  ny  +  b  sin  ny) 

"  n=l  ^  ^ 

where  N  is  a  positive  integer.  Because  the  treatment  is  similar 
to  that  of  the  previous  example,  we  shall  only  touch  on  the  points 
of  difference. 

_Again  we  fix  y  e  (0,1/4),  choose  the  root  of  (8)  satisfying 
0  <  X  <  1/2,  and  require  0  <  e  <  y.  To  insure  |h(y) |  £  1,  let 

N 

1^0 1  n-1^  ^  ’ 

Again  assuming  the  approximate  solution  of  P(e)  to  have  the 
form  (9)  and  the  free  boundary  to  have  the  form  (10),  we  are  led 
to  the  BVP  (11  -  12).  Instead  of  u(x,y)  =  v(x)cos  y,  however,  we 
now  set 

N 

u(x,y)  =  aQVQ(x)  +  I  (a  v  (x)cos  ny  +  b  w  (x)  sin  ny)  . 


Substituting  this  for  u  in  (11)  and  (12) ,  then  separating  vari¬ 
ables,  we  find  that  for  n  =  1,...,N,  both  v_  and  w  must  be  solutions 
of  the  BVP  ”  ^ 

V  =  0  (0  <  X  <  1) 

v(0)  =1  ,  V' (1)  =  0 
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while  Vq  must  be  a  solution  for  n  =  0. 

Proceeding  as  in  the  previous  example,  one  can  obtain  the 
expressions  for  Uq(x)  +  eu(x,y)  to  the  left  and  right  of  the  free 

boundary,  also  the  approximate  representation  x  =  x  +  eg(y)  for 
the  free  boundary  itself.  But  we  shall  omit  the  details. 
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ABSTRACT 

We  determine  the  propagation  constants  which  describe  mathemati¬ 
cally  the  behavior  of  electromagnetic  waves  reflected  from  dielectric- 
coated  wires.  These  are  obtained  from  the  roots  of  two  characteristic 
equations  of  transcendental  type.  The  roots  are  the  propagation  con¬ 
stants  of  the  creeping  waves  generated  by  diffraction  of  plane  waves 
polarized  tangentially  and  normally  to  the  wire  axis,  respectively. 
Their  real  and  imaginary  parts  give  the  phase  and  attenuation  of  the 
creeping  waves  around  the  circumference  of  the  wire. 


DETERMINATION  OF  PROPAGATION  CONSTANTS  IN  SCATTERING 
FROM  DIELECTRIC-COATED  WIRES 

Leon  Kotin 

Communications/Autoniatic  Data  Processing  Laboratory 
U.  S.  Army  Electronics  Command,  Fort  Monmouth,  New  Jersey  07703 

Introduction.  The  effectiveness  of  many  communication 
systems  can  be  seriously  diminished  by  reflections  of  electromagnetic 
signals  from  obstacles,  both  natural  and  man-made.  Dielectric-coated 
vn’res  constitute  a  man-made  obstacle  v/hich  appears  with  increasing 
frequency  in  military  situations.  Nor  is  this  obstacle  restricted  to 
communications  effects.  The  U.  S.  A.  Board  of  Aviation  Accident  Re¬ 
search  recently  cited  the  following  statistics  for  a  four-year  period 
of  daylight  operations  under  peacetime  conditions.  There  were  1.06 
accidents  involving  low-flying  aircraft  and  electric  wires.  These  re¬ 
sulted  in  78  fatalities,  56  injuries,  and  6.6  million  dollars  damage. 

In  this  paper  we  obtain  the  propagation  constants  which  describe 
mathematically  the  behavior  of  v;aves  reflected  from  dielectric-coated 
wi res . 

In  an  attempt  to  determine  reasonably  rapid  convergent  series 
representations  for  the  scatter  field  and  radar  response  of  dielectric- 
coated  wires,  F.  Schwering  and  C.  Be  Santis  [6]  obtained  two  compli¬ 
cated  characteristic  equations  of  transcendental  type.  The  roots  of 
these  equations  are  the  propagation  constants  of  the  creeping  waves 
generated  by  diffraction  of  plane  waves  polarized  tangentially  and 
normally  to  the  wire  axis,  respectively.  Their  real  and  imaginary 
parts  give  the  phase  and  attenuation  of  the  creeping  waves  around  the 
circumference  of  the  wire. 
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In  the  case  of  tangential  polarization  of  the  incident  wave,  the 
propagation  constants  v  are  determined  from  the  characteristic 
equation  [61 

E  (ka)W^(a,b)  -  Va)Wy(a,b)  =  0 

where 

W^(a.b)  a  OJkja)V^(kjb)  -  0^(k/)V^(kja) 


and 


w;(a,b) 


9W^(a,b) 

3(kjja) 


(21 

Here  J  and  Y  are  the  Bessel  and  Neumann  functions,  H;  the  Hankel 

V  V  V 

function  of  the  second  kind,  k  the  free-space  wave  number,  the  wave 
number  of  the  dielectric  material,  and  a  and  b  the  outer  and  inner 
radii  of  the  dielectric  coat  (see  Fig.  1). 

A  more  complicated  expression  appears  in  U^=  0,  the  characteris¬ 
tic  equation  in  the  case  of  normal  polarization.  This  will  be  treated 
analogously  later. 


Fig.  1.  Wire  with  Dielectric  Coat. 


Introducing 


X  =  ka  ,  y  =  krfa  »  ^  =  k^b  ,  Hjx)  =  H^^^(x) 

for  simplicity  into  (1)  -  (3),  we  shall  obtain  v  as  the  zeros  of  the 
function  U"  =  U  : 

V  V 

(x)W^(y,z)  -  yH^(x)W^(y,z) 

where 


and 


W^(y,z)  E  J^(y)Y^(z)  -  J^(z)Y^(y) 


3W 

W  (y,z)  E  ■—  , 


Using  function- theoretical  and  analytical  techniques,  we  shall 
obtain  first  some  general  qualitative  properties  of  v  and  then  analyti¬ 
cal  approximations  to  the  large  zeros.  Finally  v;e  shall  give  numer¬ 
ically  the  physically  significant  smallest  zeros  for  several  representa¬ 
tive  values  of  x,  y  and  z. 


2.  The  symmetry  of  the  zeros.  First  vie  show  that  the  function 
g-ivir/2  y  is  an  even  function  of  v. 

V 

Theorem  1.  If  U^  is  defined  by  (4),  then 

e-ivT7/2y  ^  giv7r/2 

V  -V 

Proof.  We  have  [5] 

J  (t)  cos  vir  -  J  (t) 

Y  (t) 

V  Sin  VTT 

whenever  v  is  not  an  integer.  (For  integral  n,  Y„(t)  =  lim  Y  (t). 

n  v^n  V 

In  this  case,  the  following  argument  can  be  modified  by  taking  limits.) 


Then 


-J^(z)(cos  vir  J^(y)  -  J.y{y))j 

Thus 

W^(y.z)  =  W_^(y,z)  (8) 

Since  [5,  p.  67]  H_^  =  we  have  from  (4) 

U_^  =  e"^'''"(xH^(x)W_^(y,z)  -  yH^(x)w;^^(y,z),  (9) 

whence  from  (8) 

U  =  (10) 

-V  V 

This  immediately  gives  us  the  desired  result: 

e-''vTr/2^j  =  e^‘vTr/2u  (11) 

V  -V 

An  obvious  consequence  is  that  the  zeros  are  symmetric  vn'th 
respect  to  the  origin  in  the  complex  v-plane. 

Corollary.  If  v  is  a  zero  of  U^,  so  is  -v. 

It  is  interesting  to  note  that  this  simple  theorem  yields  results 
v/hich  are  far  less  obvious  than  the  above  corollary.  These  results 
refer  to  the  strict  complexity  of  the  zeros  and  the  infinitude  of  zeros, 
and  appear  in  the  following  sections. 
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3.  The  strict  complexity  of  the  zeros.  In  the  rest  of  this 
paper  we  shall  denote  the  real  and  imaginary  parts  of  v  by  a  and  3, 
respectively,  i.e., 

V  =  a  +  i3 

We  now  show  that  neither  the  real  nor  imaginary  part  of  any  zero  of  U 
is  zero. 

Theorem  2.  If  =  0,  then  a3  f  0. 

Proof.  Taking  complex  conjugates  of  both  sides  of  (10), 

U  =  ei^’^  Uv 
"V 

Since  [5]  for  real  argument 


(12) 


J=J,  Y=Y,  W=W 
V  ^  vv-  Vv  vv 


(13) 


where  we  dropped  the  dependence  on  x,  y  and  z,  we  have  from  (12)  and  (4) 

W  _  -  =  e^’^’^(xHl''^'w  -  yH^’^  W) 

V  vv  vv 

If  =  0  with  3  =  Iniv  =  0,  then  v  =  v  and  we  have  the  simultaneous 


(14) 


homogeneous  equation 


Uv  =  -  yH^w;  =  0 


(15) 


the  latter  coming  from  the  right-hand  side  of  (14).  The  determinant 
of  coefficients  of  xW^  and  yir  must  then  vanish: 

4  .  h<'»h;  -  h'”'h„  =  0  06) 

This,  however,  is  impossible,  since  and  H  =  are  linearly 
independent  solutions  of  Bessel's  equation.  Indeed,  a  =  -4i/irx  f  0 
[5,  p.  68],  Thus  3  0  and  the  zeros  are  not  real. 
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Applying  a  similar  argument  assuming  a  =  0,  whence  -v  =  v,  and 
taking  the  left-hand  side  of  (14)  give  another  contradiction.  This 
shows  that  the  zeros  cannot  be  pure  imaginary  either,  completing  the 
proof  of  the  theorem. 

4.  The  infinitude  of  zeros.  We  know  from  physical  considerations, 
of  course,  that  there  exist  roots  of  the  characteristic  equation.  V^e 
now  prove  that  there  are  an  infinite  number  of  these  roots.  To  this 
end,  we  invoke  some  function-theoretical  considerations,  such  as  the 
concept  of  order  of  grov/th  u(f)  of  an  entire  (or  integral)  function 
f(v)  [1,  p.  8;  7,  p.  248],  defined  as  the  infimum  of  all  exponents  p 
such  that 


00 


lf(v)|  =  O(el^l^)  as  lv| 

Using  Poisson's  formula  [5,  p.  79] 


J^(y)  =  _ 

^  /?r 


i:/2 


-._2v 


/  cos(y  cos  t)  sin  t  dt  , 


we  find  easily  that  when  a  ^  0, 


J  (y)  < 

“  v^lr(v  +  h)  1  o 


M 


Tr/2 


/  sin^“tdt 


v'TT 


\>Zn 


r(v  +  h)  ' 


<  V  _ 

"  lr(v  +  h)  ' 


y. 

V 

In2 

(17) 


(18) 
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Thus  the  order  of  the  integral  is  at  most  1  v/hen  a  =  Re  v  ^  0. 

Moreover  the  entire  function  l/r(v)  is  of  order  1  [7,  p.  255].  Since 
the  order  of  the  product  (or  sum)  is  no  greater  than  that  of  the 
greater  factor  (or  term),  it  follows  that  the  order  of  J  (y)  is  no 
greater  than  unity  when  Re  v  =  0, 

To  eliminate  this  restriction  on  the  sign  of  a  =  Re  v,  we  use  the 
facts  that 


[4.  p.  229]  , 

H  =  J  +  iY 

V  V  V 

and 


J  =  J  cos  TTv  -  Y  sin  TTv 

-V  V  V 


(19) 

(20) 

(21) 


From  (19)  and  (20),  we  find  that  co(Y^)  =  1  for  a  =  0.  Then  we  con¬ 
clude  from  (21)  and  earlier  results  that  w(J^)  =  1,  with  no  restriction 
on  a.  Moreover,  since  Y^,  and  their  derivatives  can  be  expressed 
[5,  §  3.1]  in  terms  of  and  the  Bessel  function  J  with  indices 
±v,  +v  +1  and  ±v-l  ,  it  follows  finally  that 

Lemma.  The  order  of  growth  of  is  less  than  or  equal  to  1. 

Now  let  =  X.  Then  since  is  an  entire  even  function 

of  V  of  order  =  1,  the  function  f(x)  e  e  is  an  entire  function 

of  X  whose  order  is  ^  Consequently  [7,  pp.  250,  252],  f(x)  has  an 
infinite  numbe'  of  zc^’cs  Xj,.  From  the  definition  of  f(\),  v/e  conclude 
Theorem  3.  has  an  infinite  number  of  zeros. 
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Moreover  [7,  p.  250]  wa  obtain  the  follovnng  product  repre¬ 
sentation: 

f(A)  =  f(o)  ^  h  -  A  (22) 

k=l\  Ak/ 

Expressed  in  terms  of  U^,  (22)  becomes 

U  =  ^  (  1  -  4^  (23) 

''  °k=lV  ''k/ 

where  the  v,^  are  the  zeros  of  LK  Note  that  from  Theorem  2, 

Uq  ^  0,  as  is  required  for  this  product  representation  to  be  valid. 

5.  The  large  zeros  of  U^.  Since  there  are  an  infinite  number 
of  zeros  of  the  enti re  function  the  zeros  are  arbitrarily 
large.  To  approximate  these  when  jv]  »max{x,y5z),  we  first 
express  U  in  terms  of  0  alone  using  standard  identities  [5,  §  3.1], 

obtaining 

2i  sin^  vir  .-((x)  -  +  ^(x)^  +  +  i(^)  "  "'-v  -  l^^^j 

X  [j.„(x)(jv  -  I'x)  -  Jv  +  1<X>)  +  -  J-v  - 


(24) 


(y) 
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Then  using  the  asymptotic  behavior  of  J  (t): 

h* 


for  large  y,  and  dropping  the  lower-order  terms,  we  obtain  from  (24) 


vln{+  2yv/exz)  ~  (n  +  %)Tri  as  +  g  >  o 

implicitly  giving  approximately  the  n-th  zero  for  large  n. 

Since  the  zeros  are  symmetric  in  the  v-plane,  we  can  select 
B  >  0  and  thus  drop  the  lower  signs  in  (26).  Rewriting  (26)  as 
V  *■  (n  -  %)Tri  /  £n(-2yv/exz)  , 

iterating,  and  neglecting  the  lower-order  terms,  we  obtain  the 
follov/ing  explicit  app»'oximation  to  the  large  zeros. 

Theorem  4.  The  large  zeros  of  in  the  upper  half-plane 
are  given  by 


^2 

y  3  -(n  -  +  (n  -  %)Tri  In  ((2n  -  %)Try/exz) 

n  - - — — — — _ _ _ 

[£n((2n  -  h)iry/exz)]^ 
for  sufficiently  large  integers  n. 

As  a  consequence, 


~  as  n  -i-  » 


(29) 


since  the  real  part  approaches  infinity  more  slowly  than  the 
imaginary  part.  Furthermore,  it  can  easily  be  shown  from  (28) 
that  the  distance  between  consecutive  zeros  approaches  zero. 

We  remark  that  this  behavior,  indeed  the  asymptotic  represen¬ 
tation  (28),  is  very  similar  to  that  of  H^^^(x),  which  arises  in 
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the  theory  of  diffraction  of  electromagnetic  waves  by  a  perfectly 

conducting  sphere  (cf.  [2],  [3],  [4]). 

6.  The  case  of  normal  polarization.  If  the  incident  wave 
is  polarized  normally  to  the  axis  of  the  dielectric-coated  vnre, 
the  characteristic  equation  is 


JL 

u 

V 


yH^(x)|j^(y)Y^(z)  -  J^(z)Y^(y)]  -  xH^(x)[r (y)Y^(z)  -  J^(z)Y^(y) 


=  0 


(30) 


Since  the  treatment  of  this  case  is  identical  to  the  previously 
discussed  case  of  parallel  polarization,  it  suffices  merely  to 


state  the  corresponding  results. 
5. 


Theorem 


Corollary.  If  v  is  a  zero  of  U  ,  so  is  -v 


j- 


Theorem  6.  If  U  =  0,  then  Re  vim  v  ^  0. 


Theorem  7.  has  an  infinite  number  of  zeros. 


7.  The  smallest  zeros.  Following  are  a  table  and  curves  (Fig.  2) 
of  the  smallest  zeros  of  L'J[  and  U^J^in  the  second  quadrant  of  the 
complex  v-plane  for  each  of  several  representative  values  of  the 
parameters  x,  y,  z.  These  values  are  x  =  0.5(0. 5)5,  with 
y  =  1.5x  and  z  =  0.9 y  .  We  recall  that  x  =  ka,  y  =  k^a,  and 
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2  =  kjb  where  k  is  the  free-space  wave  number,  k^  the  wave  number 
of  the  dielectric  coat,  and  a  and  b  che  radii  of  the  coat.  The 
coefficient  1.5  =  f<fj/k  is  the  refractive  index  of  polyethylene 
and  the  coefficient  0.9  =  b/a  is  the  ratio  of  the  two  radii. 
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zeros  v/ere  obtained  by  J.  f 

lerder  of  ECOM' 

s  Math. 

Support  Division  using  a  Burroughs  B-5700  and  the  Bessel  routine 
provided  by  M.  Goldstein  of  New  York  University.  The  following 
curves  were  obtained  i-om  the  above  data  by  C.  De  Santis  of  the 
Communications  Research  Tech.  Area. 
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flame 


INTRODUCTION.  This  is  a  review  of  some  classical  problems  in  ^laminar 
ry  that  essentially  assumes  no  knowledge  of  combustion  by  the  reader. 


Laminar  flame  theory  is  a  branch  of  fluid  mechanics  --  essentially  the  motions 
of  the  gases  in  a  flame  are  governed  by  the  compressible  Navier-Stokes  equations  -- 
but  there  are  of  course  some  crucial  features  which  are  not  nornially  foimd  in  classical 
fluid  mechanics.  For  one  thing  one  is  dealing  with  a  mixture  of  different  gases  and  it 
is  necessary  to  say  something  about  changes  in  each  of  the  components  of  the  mixture. 
Secondly,  and  most  important,  there  are  chemical  reactions  so  that  there  is  a  source 
or  sink  term  in  the  mass  conservation  equation  for  each  component.  Moreover,  heat 
is  released  by  the  chemical  reactions  so  that  there  is  a  source  term  in  the  overall 
energy  equation.  These  chemical  reactions  are  extremely  sensitive  to  temperature  -- 
they  usually  won't  take  place  at  all  if  the  temperature  is  too  low  (which  is  fortunate) -- 
and  an  essential  feature  of  combustion  that  helps  distinguish  it  from  other  branches  of 
aerothermochemistry  is  that  the  high  temperatures  necessary  to  sustain  the  reactions 
are  generated  by  the  heat  released  by  the  reactions  themselves.  Providea  there  is  an 
adequate  supply  of  fuel  and  oxygen,  combustion  is  a  self-sustaining  process. 

There  are  two  different  approaches  to  the  theory  of  combustion  that  one  can  take. 
One  is  to  insist  on  being  as  realistic  as  possible  and  retain  in  the  formulation  of  the 
problem  all  the  complexities  that  might  play  a  role  in  practice.  This  of  course  leads 
to  equations  of  remarkable  complexity  which  can  only  be  solved  numerically.  Such  ^ 
approach  has  its  advocates  (and  is  necessary  if  detailed  quantitative  results  are  needed) 
but  a  more  fruitful  approach,  given  the  present  state  of  combustion  science,  is  to  strip 
each  problem  down  to  its  fundamentals  and  write  down  model  equations  that  are  clearly 
inappropriate  in  reality  but  nevertheless  contain  the  physical  features  which  are  the 
essence  of  the  problem.  The  hope  is  that  the  equations  are  simple  enough  to  solve 
analvtically,  or.  if  recourse  to  a  computer  is  still  necessary,  simple  enough  so  that 
useful  information  can  be  extracted  from  the  numbers.  Quantitative  accuracy  is  sacri¬ 
ficed  for  qualitative  understanding. 

Actually  there  is  a  third  approach  to  studying  combustion  problems  that  has  been 
quite  popular  but  which  should  be  avoided  if  at  all  possible.  One  starts  by  writing  down 
sensible  model  equations  but  then  constructs  what  might  be  called  model  solutions  . 
That  is  solutions  are  constructed  using  ad  hoc  irrational  approximations  and  as  a  con¬ 
sequence  one  can  never  be  sure  of  the  significance  of  the  end  results.  It  isn  t  clear 
whether  the  features  of  the  solution  are  creatures  of  the  original  model  or  of  the  irra¬ 
tional  approximations.  This  makes  systematic  development  of  the  subject  difficult  and 
has  led  to  spurious  results  in  the  past. 


'This  is  a  more  or  less  verbatim  transcript  of  a  review  that  was  specific^ly  prep^ed 
for  oral  presentation,  so  that  the  reader  is  asked  to  forgive  the  colloqmal  style.  Tbe 
footnotes  were  not  part  of  the  original  presentation  but  have  been  added  for  the  sake  of 
cl3.3rity  •  133 


Of  course  it  is  clear  why  such  an  unsatisfactory  approach  has  been  popular  — 
for  many  years  no  rational  systematic  method  of  solving  the  various  model  equations 
was  known  (although  the  literature  is  replete  with  brilliant  ad  hoc  analyses).  But  in 
recent  years  that  has  changed,  and  it  would  probably  be  fair  to  say  that  there 
has  been  a  revolution  in  combustion  theory.  At  the  heart  of  this  revolution  was  the 
re^ization  that  combustion  theory  has  its  own  unique  asymptotics  which  can  be  exploited 
using  sin^lar  perturbation  theory.  In  particular,  a  combination  of  Damkohler  Number 
asymptotics  and  activation  energy  asymptotics,  where  appropriate,  can  often  lead  to  the 
solution  of  model  equations  that  were  for  many  years  thought  too  difficult  to  solve. 

What  I  want  to  do  today  is  briefly  describe  the  nature  of  these  asymptotic 
methods,  concentrating  particularly  on  activation  energy  asymptotics;  describe  the 
mathematical  details  of  a  particularly  simple  application  of  activation  energy  asymp¬ 
totics;  and  then  describe  a  perturbation  procedure  that  generates  nonlinear  solutions 
or  a  variety  of  problems,  including  a  certain  class  of  unsteady  problems.  In  no  sense 
am  I  going  to  attempt  an  exhaustive  review. 

Let  us  start  by  looking  at  a  specific  problem. 


2.  QUASI-STEADY  FUEL  DROP  BURNING 


Oxygen 
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Figure  1  represents  an  idealized  model  of  a  burning  fuel  drop.  The  situation 
is  assumed  steady  and  spherically  symmetric,  conditions  never  realized  in  practice 
which  emphasizes  that  we  are  examining  a  highly  idealized  model. 

The  ball  of  fuel,  in  liquid  form,  is  hot  because  of  the  presence  of  the  flame 
and  as  a  consequence  It  evaporates,  mixes  by  diffusion  with  the  surTOunding  MMsgere 
of  oxygen,  and  then  this  mixture  burns  witlnn  the  flame.  Appropriate  model  equations 

axe,^ 


S.Y  =  ^ 


dY 


0 

dr 


_d 

dr 


dY 


(r' 


0 


dr 


=  -  Dj  Yq  Yp  T^^  exp  ( 


E 

RT 


)  =  -  CO 


XYp  =  -  CO 


X  T  =  Qco  . 

These  eauations  are  based  on  the  simple  chemical  kinetic  scheme 

[  Fiiel]  +  [  Oxygen]  — [  Product]  . 

The  kinetics  of  a  real  flame  are  much  more  complicated  than  this  but  nevertheless  the 
simple  model  preserves  three  essential  features  —  okygen  is  consumed,  fuel  is  con¬ 
sumed,  and  heat  is  generated. 

Looking  at  the  equation  for  Ycv,  the  teas s  fraction  oxygen,  we  see  that  there 
are  three  terms.  The  first  term  is  a'^mass  transport  term  (there  is  a  radi^  flux  of 
fuel  and  therefore  a  mass -averaged  radial  velocity)  and  M  is  a  measure  of  *e  flux 
of  fuel  leaving  the  surface.  It  can  be  regarded  as  the  fundamental  unknown  of  the 

problem. 

The  second  term  is  a  diffusion  term. 

The  third  term,  the  chemical  reaction  term,  simply  indicates  that  the  amount 
of  oxygen  consumed  depends  on  how  much  oxygen  is  present,  how  much  fuel  Is  present, 
and  the  temperature  T.  The  most  important  part  of  the  temperature  dependence  is 
the  exponential  factor  --  R  is  the  gas  constant  and  E  is  a  constant  known  as  the  acti¬ 
vation  energy.  E  tends  to  be  rather  large  so  that  the  reaction  rate  is  very  sensitive 
to  changes  in  the  temperature. 

D  is  a  parameter  that  depends  on  a  number  of  things  including  the  pressure 
(which  is\miform)  and  is  known  as  the  Damkohler  Number. 

The  equation  for  Yp  is  identical  to  that  fot  Y^,  a  consequence  of  assuming 
equal  diffusion  coefficients:  The  energy  equation  (whl^  is  an  equation  for  the  temper¬ 
ature  since  the  thermal  energy  is  much  larger  than  the  kinetic  energy)  is  very  similar 
(the  Lewis  number  equals  one)  but  the  reaction  term  appears  with  a  positive  sign  since 
heat  is  generated  by  the  reaction,  and  the  amount  of  heat  generated  is  characterized 
by  the  parameter  Q. 


■^Kassoy,  D.  R.  &  Williams,  F.  A.  Physics  of  Fluids,  1J_,  1343  (1968). 
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There  are  appropriate  boundary  conditions  (which  have  not  been  written  down) 
4  at  the  surface  and  3  at  infinity,  naming  a  total  of  7.  Since  the  system  is  a  sixth 
order  one,  these  conditions  are  sufficient  to  determine  the  three  field  variables  and 
M,  which  is  a  measure  of  the  burning  rate. 

There  are  many  different  ways  of  characterizing  the  solution  of  this  problem 
and  one  way  is  to  plot  the  variation  of  M  with  D^.  ’ 


Fig.  2.  Burning  Response  for  a  Fuel  Drop 


Figure  2  is  typical  of  the  kind  of  response  one  gets  -  -  an  S  shaped  curve, 
and  at  the  risk  of  oversimplification  the  turning  points  are  labelled  as  the  ignition 
point  and  the  extinction  point.  The  reason  for  this  is  that  if  the  response  is  on  the 
lower  branch  of  the  curve,  where  the  burning  is  weak,  and  Di  is  increased  (by 
increasing  the^  pressure,  for  example)  then  the  response  moves  to  the  right  until  the 
Ignition  point  is  reached  whereupon  any  further  increase  in  D.  causes  a  jump  to  the 
top  branch  where  the  burning  is  strong.  A  subsequent  decrease  in  D,  moves  the 
response  to  the  left  along  the  strong  burning  branch  until  the  extinction  point  is  reached 
where  the  response  drops  back  to  the  weak  burning  branch. 


The  oversimplification  stems  from  the  possibility  that  the  response  is  forced  off  of 
one  of  the  branches  by  instability  before  the  turning  point  is  reached.  This  happens 
in  chemical  reactor  theory  where  similar  S-shaped  responses  occur  (Cohen  D  S 
&  Poore,  A.  B.  SIAM  J.  Appl.  Math. ,  27,  416  (1974). 
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Consider  now  the  ends  of  the  curve.  At  the  left  vanishes,  whence  co 
vanishes  and  the  equations  reduce  to  linear  equations  which  can  be  easily  solved.  This 
so  called  frozen  limit  is  of  little  interest  since  there  is  no  combustion. 

The  right  hand  end  (D,  — >•  oo)  is  much  more  important  since  typic^  , 

countered  in  everyday  life  often  have  very  large  Damkohler  numbers.  The  limit  (called 
the  equilibrium  limit)  is  a  singular  one  in  which  the  coefficient  of  tjie  highest  denv^ive 
vanishes,  and  so  thin  layers  (boundary  layers  or  .interior  layers  )  can  occur.  Outside 
of  these  layers  it  is  apparent,  since  w  must  be  finite,  that  as  >  oo  , 

and  so  Yn  and/or  Yp  must  vanish,  co  is  then  the  product  of  something  that  goes  to 
infinity  times  something  that  goes  to  zero  and  it  is  clear  from  the  equations  (the  eq^- 
tionfor  Yn  when  vanishes)  that  this  product  vanishes  in  the  limit.  In  this  sense 
there  are  similarities  between  the  equilibrium  limit  and  the  frozen  limit,  but  the 
possibility  of  thin  layers  in  the  fomer  case  is  a  crucial  difference. 

Imoortant  though  Damkohler  Number  asymptotics  may  be,  it  obviously  c^ot 
tell  us  anything  about  ignition  or  extinction,  so  that  if  we  wish  to  bridge  the  gap  between 
Di  =  0  and  Di-t»oo  a  different  approach  is  necessary.  Activation  energy  asymptotics 
is^an  appropriate  tool.  More  precisely  we  consider  the  solution  of  the  equations  when 


where  T  t  is  same  reference  temperature.  This  is  a  realistic  limit  in  many  com¬ 
bustion  situations,  it  can  be  used  to  solve  many  import^t  combustion  problems, 

and  it  is  mathematically  interesting  because  the  large  parameter  appears  in  an  uncon¬ 
ventional  fashion,  as  the  argument  of  an  exponential. 

One  thing  that  is  immediately  clear  is  that  we  can  not  just  put  li  -  oo  without 
doing  anything  else  since  that  just  yields  the  frozen  limit  (w  =  0).  Bear  in  mina  ffiat 
we  want  to  determine  how  the  response  changes  with  D^,  ^d  the  above  observation 
implies  that  only  when  is  very  large  can  we  get  away  from  the  frozen  limit.  What 
we  have  to  do  is  write 

=  exp  (E/RT:^) 

where  T^  is  a  temperature  that  characterizes  the  magnitude  of  so  that 

r  E  /  1  _  J_M 

03  oc  6Xp  ^T 

and  then  the  behavior  of  co  in  the  limit  E  oo  depends  upon  the  relative  magnitudes 
of  T  and  T*.  There  are  three  possibilities. 

(i)  In  regions  where  T  >  T*  the  exponential  goes  to  infirffiy  in  the  limit,  so  that 
Y  Y  0,  03  0,  corresponding  to  equilibrium. 

OF 


See  Buckmaster,  J.  D.  Combustion  and  Flame,  24,  79  (1975) 


Thin  layers  can  occur  in  such  regions  of  course. 


(ii) 

In  regions  where  T  <  T^  the  exponential  vanishes  so  that  w 
situation. 

0,  a  frozen 

(iii) 

Finally,  in  transition  regions  where  T  ^  T^  (more  precisely, 

0  (  g*  )  )  the  exponential  can  be  simplified  slightly. 

T-T*  _ 

T.. 

^  exn  r  ^  1 

but  th6  irnportcLtit  point  is  that  to  docs  not  vanish  so  that  such  a  region  is  a  reaction 
zone.  Reaction  zones  are  often  thin  (but  not  necessarily  so)  in  which  case  they  are 
called  flame  sheets. 

Application  of  activation  energy  asymptotics  to  a  steady  one-dimensional  prob¬ 
lem  such  as  the  fuel  drop  problein  requires,  in  general,  the  construction  of  solutions 
in  the  three  different  kinds  of  regions  and  matching  them  in  the  usual  way  (that  is,  in 
the  sense  of  matched  asymptotic  expansions).  Usually,  the  most  difficult  part  of  this 
procedure  is  deciding  what  regions  are  needed  and  where  they  are  located.  As  an 
example,  if  we  ask  what  is  the  nature  of  the  solution  for  a  point  on  the  middle  branch 
of  the  S-shaped  response  (Fig.  2),  it  turns  out  that  T^,  is  the  maximum  temperature. 
That  is,  at  some  finite  value  of  r  the  temperature  is  equal  to  so  that  all  the 
reaction  occurs  in  a  thin  flame  sheet  located  there,  and  on  either  side  of  the  sheet  the 
combustion  is  frozen  (Fig.  3). 


Fig.  3.  Typical  Temperature  Distribution  for  a 
Solution  on  the  Middle  Branch 
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In  the  frozen  regions  solutions  can  easily  be  constructed  of  the  linear  governing  equa- 
tioas  Tthe  flame  sheet  the  description  is  nonlinear,  but  because  the  sheet  xs  thxn 
the  equations  are  simplified.  Matching  the  flame  sheet  solution  with  the  soluuons  m 
the  frozen  regions  ultimately  leads  to  the  complete  solution  of  the  problem  ^d  in 

t£  detarmlnatton  of  the  burning  rate  A  remark  about  the  nature  of  the 

solution  on  the  other  two  branches  will  be  made  later. 

3  PREMIXED  FLAMES.  The  fuel  drop  problem  is  an  example  of  what  is  known 
as  a  diffusion  flabe There  ire"other  kinds  of  flames  in  which  the  react^ts  are  sup¬ 
plied  as  a  homogeneous  mixture  which  merely  needs  to  be  raised  to  an  adequate  tem¬ 
perature  to  initiate  burning.  Such  flames  are  called  premixed  flames,  a  common 
Lample  being  the  inner  cone  of  a  bunsen  burner  flaine  (observed  when  the  air  hole 
is  open  which  permits  oxygen  to  mix  with  the  gas  as  it  passes  up  the  tube). 

If  a  match  is  applied  to  such  a  mixture,  confined  within  a  tube,  the  mixture 
will  burn  and  a  flame  will  travel  down  the  tube  consuming  the  mixture  as  it  goes. 

Under  ideal  conditions  this  flame  travels  as  a  progressive  wave  with  a  more  or  less 
well  defined  wave  speed,  and  one  of  the  classical  problems  of  laminar  flame  theory  m 
to  determine  that  wave  or  flame  speed.  What  I  w^t  to  do  now  t 

this  can  be  done  using  activation  energy  asymptotics,  since  this  is  one  of  the  simpl 
nontrivial  applications  of  activation  energy  asymptotics  presently  known. 

For  a  premixed  flame  the  simplest  kind  of  sensible  chemical  kinetic  scheme  is 

[  Mixture]  =*>  [  Product] 
at  a  rate  co  =  BY  exp  (-E/RT) 

where  Y  is  the  mass  fraction  of  mixture  (a  preexponential  temperature  dependence 
like  the  T®  that  was  included  in  the  fuel  drop  equations  could  be  mserted  without 
essentially  changing  the  subsequent  discussion). 

The  flame  is  assumed  to  be  one-dimensional  and  the  situation  in  a  flame-fixed 
frame  is  shown  in  Fig.  4. 


Hot  Product 


Cold  Mixture 


Y=0  ,  T=Tj 


Y=1  ,  T=T 


S  Stationary  Flame 

Fig.  4.  The  One -Dimensional  Premixed  Flame 


*The  work  of  A.  Linan,  Astronautica  Acta,  1,  1007  (1974)  on  &e  counterflow  diffusion 
flame  provides  an  exhaustive  description  of  calculations  of  this  kind.  Kapila  A.  K. , 
Ludford,  G.  S.  S.  &  Buckmaster,  J.  D.  Combustion  and  Flame,  361  (1975) 
describe  similar  calculations  for  a  spherical  pre mixed  flame. 
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•  u  mixture  comes  in  from  the  right  and  passes  through  the  flame  where  it 

IS  burnt  emerges  as  hot  product  on  the  left.  The  reaction  only  stops  when  all  the 

mixture  is  consumed  so  that  Y  =  0  on  the  left  and  the  temperature  there^ST  T  1 
the  so  called  adiabatic  flame  temperature.  f 

Appropriate  model  equations  are 

d  Y  /I  V 

ST  =  ^  -  by  exp  (-E/RT) 

P^  S  ^  QBY  exp  (-E/RT) 


pv  = 


m  (constant) 


pT  =  constant 

which  are  similar,  in  many  respects,  to  the  fuel  drop  equations  written  down  earlier. 

™  the  fundamental  unknown  being  essentially  the  flame 

w  wr  simply  a  statement  that  the  pressure  is  constant,  valid 

for  low  Mach  Number  flames.  ’ 

The  flame  temperature  T.  can  be  determined  without  solving  this  system 

exactly  how  much  of  the  mixture  is  con¬ 
sumed  (all  of  it)^d  we  know  exactly  how  much  heat  is  released  per  unit  of  mixture 
consumed  (Q).  Therefore  an  overall  energy  balance  requires 

Cp  <Tf  -  T„)  =  Q. 

The  system,  when  appropriately  non-dimensionalized^  is 

-  ^  =  1  BA 

if  L  ,.2  ■  ~1~  ® 

d§  m  C 


-^  =  O  +  Ba 


(^  ~  X,  9  ~  E,  (}>  ~  T,  L  =  is  the  Lewis  No.) 

P  P 


as  +  00  Y->  1, 


4) 


as 


1  > 
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and  the  essential  idea  is  that  this  system  only  has  a  solution  for  a  unique^  choice  of 


the  parameter  — s— 
m^C 


and  so  in  this  way  the  flame  speed  can  be  determined. 


An  enormous  amount  of  ingenious  effort  has  been  expended  over  the  years  on 
the  solution  of  this  problem,  and  literally  dozens  of  approximate  solutions  c^  be 
found  in  the  literature  each  purporting  to  be  simpler  or  more  accupte  tl^ 
attempts.  Most  of  this  work  was  rendered  obsolete  in  1970  by  Bush  ^d  Fendell  who 
showed  how  the  problem  can  be  solved  rationally  in  the  limit  of  infimte  activation 

energy  (9  — >-oo). 

Just  as  for  the  fuel  drop  problem  we  can  not  just  put  6  =  oi  in  the  equations  — 
it  is  necessary  to  let  - >oo  at  the  same  time.  More  precisely  we  write 


L  (1+ V 


0^  exp  [  0(1) 


a  choice  partly  motivated  by  the  observation  that  we  would  expect,  on  physica  S  > 

that  the  flame  temperature  (l+<)>oo)  is  the  maximum  temperature  Md  moreover  a 
increases  monotonically  from  to  (1  as  the  flame  is  traversed.  Be  that 

as  it  may,  the  problem  is  to  find  fi. 

The  reaction  rate  co  is  proportional  to 

r  e  e  T 

so  that  wherever  <t)  is  less  than  the  flame  temperature  the  reaction  is  frozen  and  the 
governing  equations  are 


+  =  0 


d^Y  ,  T  dY 


'*'ln  actual  fact  the  system  as  written  doesn't  have  a  solution  at  all,  since  the  upstream 
state  Y  =  1  ^  =  4>  is  not  a  solution  of  the  equations  (the  so-called  cold  bounda^ 

difficulty).  The  problem  arises  because  the  temperature  dependence  of  the  reaction 
rate  is  not  accurately  modelled  by  exp  (-9/<t>)  when  (j)  is  small.  A  re^istic  ^®solu- 
tion  of  the  difficulty  is  to  introduce  a  cutoff  temperature  lying  between  4>oo  aiid  l+9oo 
below  which  the  reaction  rate  is  identically  zero.  No  specific  choice  for  this  teni- 
perature  is  needed  when  the  activation  energy  9  is  large,  as  the  subsequent  analysis 

shows. 

"^Bush,  W.  B.  &  Fendell,  F.  E.  Combustion  Science  &  Technology,  1^,  421  (1970). 
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with  elementary  solutions .  The  location  of  the  origin  of  coordinates 
so  that  these  equations  are  valid  in  ^  >  0. 


can  be  chosen 


Noting  that 


Y  0,  <|5—  l+<[) 

^00 


is  an  exact  solution  of  the  complete 
is  obtained  by  piecing  together  this 
frozen  equations,  as  shown  in  Fig. 


equations,  the  large  scale  structure  of  the  flame 
exact  solution  and  appropriate  solutions  of  the 


Fig.  5.  Large  Scale  Structure  of  the  Flame 


as  ^ 


The  frozen  solutions  in  |  >  0  are  chosen  to  ensure  that  the  boundary  conditions 
00  are  satisfied  and  that  ()>  and  Y  are  continuous  at  the  origin. 


solution  It  IS  necessary  to  analyze  the  thin  region  near  the  origin 
where  the  derivatives  are  smoothed  out.  The  chemical  reaction  is  co^ined  to  this  re-^ 
gion,  which  IS  therefore  a  flame  sheet,  and  the  local  solution  has  the  form 


(1  +  <(>00)  +  ^(1  +4>oo)^  +  O(^) 

Y  ~  i  y  (ji)  +  o(L) 

^  thickness  of  order  O  ( ^  )  but  gradients  there 
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The  perturbation  quantities  satisfy 


dr  d£ 
It  follows  that 


=  4  [  — ^1  = 

I  L  (l+<t>oo)  J 


n  y  e 


L  (l+'i’oo)' 


y  +  L  ^ 


matr;Si  ^rs^l^tion  sheet)' implies  that  I  is  identically  zero, 

problem  for  ip  alone  toay  then  be  formulated 


linear  function  Z(z).  But  then  matcWng^(both  ^y_  and^  /anish^as 


as  ^ 


0  =  4 

dJl^ 

►  -  CO  ip 


J2  e^ 


as  i 


+00 


dii 

HI 


- 1 


The  latter  boundary  condition  arises  from  matching  ahead  of  the  flame  sheet  (Fig.  5), 


Integrating  once, 

0  =  -  2^i(ipe^  -e’f^+l) 

and  then  applying  the  condition  as  Jl  -+  oo  leads  to  Bush  and  Fendell’s  result 
and  completes  the  determination  of  the  flame  speed. 


_  There  are  two  features  of  Bush  and 

First  of  all,  one  of  the  reasons  that  the 

analysis  is  so  simple  is  that  the  chemistry  free  equations  can  be  so  easily  solved.  This 

cUi.<xxy  t'  _ _  wit'll  a  TYlOTe  COm 


4.  THE  MODIFIED  PRE MIXED  FLAME 
Fendell’s  solution  that  I  want  to  emphasize. 


SSs?s\he TlS  que^loT  welr'a  concernad  wift  a  njira  cornpUcatad 

prSlem,  one  related  to  the  one-dimensional  premixed  flame  but  whose  descripUon  re- 
Quires  additional  terms  in  the  governing  equations.  What  additional  tennis  would  lead 
to  chemistry-free  equations  as  easy  to  solve  as  Bush  and  Fendell  s?  Such  a  question 
obviously  does  not  have  a  unique  answer,  but  one  possibility  is 
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d4)  d<t)  -  If/txv  \ 

— 2  ^  -  ft  f  (§>  9,  Y,  ---  ) 

d^^  d§  0 


,  T  dY  _  1  ... 

^  +  L  -  Jgg,  4,,  Y, 


0 


-  ) 


where  f  and  g  are  quite  arbitrary.  Perturbation  solutions  of  these  equations  can 
easily  be  constructed. 


The  second  thing  to  notice  about  Bush  and  Fendell's  solution  is  that  the  flame 
speed  is  extremely  sensitive  to  the  value  of  the  maximum  temperature.  The  expression 
for  the  flame  speed  (essentially  m)  is  ^ 


BA 


Jt 

m  C 


2L  (1+d)  )■ 


exp  [. 


l+cf> 


00 


and  It  IS  clear  that  small  changes  in  the  flame  temperature  (1+4)  )  will  generate  laree 
changes  in  the  flame  speed.  Order  0  (1/9)  changes  in  temperaSre  are  sufficient  to 
generate  0  (1)  changes  in  the  speed,  for  example.  The  significance  of  the  modified 
equations  written  down  above  is  that  we  might  expect  that  the  0  (1/0)  perturbation 
terms  c^  generate  0  (1/0)  changes  in  the  maximum  temperature  and  thus  lead  to  solu¬ 
tions  quite  different  from  Bush  and  Fendell's.  And  yet  we  would  not  expect  the  Inclusion 
of  these  terms  to  unduly  complicate  the  analysis. 

Let  us  consider  a  simple  example. 


.  5.  EFFECT  OF  HEAT  LOSSES.  In  any  real  flame  there  are  heat  losses  due 
radiation  or  conduction  to  adjacent  boundaries.  In  a  one-dimensional  formulation  these 
losses  can  be  modelled  by  adding  a  term  -K  (T-T^)+  K  =  constant,  to  the  energy 
equation  so  that  °° 

E  '  -  K(T-T.^)  +  QBY  exp(-^). 

The  extra  term  tends  to  drive  the  temperature  to  the  reservoir  value,  and  the  modified 
equations  are  of  the  type  discussed  above  provided  the  magnitude  of  K  is  such  that  the 
non-dimensional  term  is  O  (1/6). 

...  in,  complicated  than  Bush  and  Fendell  but  no  new  principles 

are  involved,  and  defming  ^ 


Quite  general  functions  of  T  can  in  fact  be  handled  by  the  analysis,  see  Buckmaster 
J.  D.  Combustion  &  Flame  (in  press).  ’ 
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H  =  Flame  Speed/Adiabatic  Flame  Speed 

we  find 

(l+cj,^)^  In  H  +  K'  =0 


where  K’  is  K  non-dimensionalized. 

When  K’  vanishes  there  are  two  solutions,  H  =  1 

l“4e  tee  Se  lo  solutions  (Fig.  6).  This  principle  has  been  known  for  many  years 


Fig.  6.  Flame  Speed  vs  Heat  Losses 


and  i<?  the  foundation  of  the  miner's  safety  lamp  invented  by  Humphrey  Davy  m  the 
Sriv  r9th  Sntury!  The  safety  lamp  consists  of  a  naked  flame  surrounded  by  a  wire 
cauze  caee  and  if  this  is  carried  into  a  combustible  atmosphere,  the  latter  passes 
through  the’  gauze  and  burns  on  contact  with  the  flame.  Without  the  gaup  cage  the 
flame  would  soread  through  the  surrounding  atmosphere,  usually  in  a  violent  (expl 
sive)  fashion  ^but  the  gauze  is  such  an  efficient  conductor  of  heat  that  the  flame  can  n  t 
?ass  toough  it  Thus  *e  miner,  on  seeing  the  flame  flare  up,  can  safely  retreat. 

Looking  again  at  the  response  diagram  (Fig.  6),  recall  that  as  we  rnove  around 

the  curv^S^Smum  temparSure  changes  by  only  an  0(1/9)  amomt. 

tr,  Flip  flip!  dron  resDonse  (Fig.  2),  the  top  branch  of  the  curve,  mcluding  the  extinction 

roLifoS  Sr  whlch'tha  maximum  tamparature  differs  by  only  an 


Bush  and  Fendell's  result. 
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0(1/9)  amount  from  the  maximum  temperature  in  the  equilibrium  (D  oo)  limit. 

The  lower  branch,  including  the  ignition  point,  corresponds  to  solutions  for  which  the 
maxinium  temperature  differs  by  only  an  O  (1/0)  amount  from  the  maximum  temper¬ 
ature  m  the  frozen  (Dj  -  0)  limit.  Thus  there  is  an  analogy  between  the  C- shaped 
quenching  curve  of  Fig.  6,  and  the  C-shaped  extinction  and  ignition  curves  of  Fig.  2. 


Once  the  idea  of  adding  O  (1/0)  perturbation  terms  to  systems  of  flame  equa¬ 
tions  and  looking  for  solutions  that  differ  by  an  O  (1)  amount  from  the  unperturbSl 
solution  IS  unoerstood,  there  are  an  infinite  number  of  problems  that  one  can  examine 
Jne  IS  limited  only  by  one's  imagination  in  cm^uring  up  different  kinds  of  perturba¬ 
tions,  and  of  course  any  flame  can  be  perturbed,  not  just  the  one-dimensional  premixed 
ilame.  Let  us  consider  some  unsteady  examples. 


one-dimensional  premixed  flame.  Consider  the  unsteady 

form  of  Bush  and  Fendell  s  problem,  for  which  the  equations  are: 


0Y  . 

P  ^  ^  P  0x 


9Y  9  r  9Y,  E 

^  (P  D  ^)  -  BY  exp  (-  ^) 


P  ‘^p  ^  P  ^  H  =  +  Q  BY  exp  (■ 


If  +  4  (P  ^  0 


p  9x  Sx^  8x 

P  T  =  constant. 


RT 


These  differ  from  the  earlier  equations  only  by  the  addition  of  the  time  derivatives 
Now  the  steady  flame  has  a  characteristic  thickness, 

A 

~  me  • 

P 

There  is  a  characteristic  velocity,  the  flame  speed, 
m 

p  ’ 

^00 

and  so  we  can  define  a  characteristic  time 

X  p 
■^00 


2 

C 


If  we  try  to  solve  the  unsteady  equations  —  as  an  initial  value  problem,  for  example  — 
then  most  disturbances  will  change  on  this  time  scale  and  will  be  governed  by  the  com¬ 
plete  sptem  of  equations,  without  simplification.  Even  without  chemistry  this  system 

presents  a  formidable  challenge.  However,  it  is  conceivable  that  there  are  distuLances 
that  change  on  the  much  larger  time  scale  i-omices 


196 


0  X  p 

t  =  )  , 

m  c 

p 

in  which  case  the  time  derivatives  are  O  (  ^  )  terms  and  so  can  be  handled  in  the 
same  v/ay  as  the  small  heat  loss  term.  Indeed  we  find 

2(l+<t)J^H^lnH+b^=0 

an  equation  first  derived  by  Siwashinsky.  ^  Here  t  is  time,  H  is  the  flame  speed 
ratio  as  before,  and  b  is  a  parameter  that  depends  upon  the  Lewis  Number  L. 

b  <  0  if  L  >  1 

b  >  0  if  L  <  1 

b  =  0  if  L  =  1  . 

Apparently,  when  L  =  1  there  are  no  disturbances  that  change  on  the  slow  time 
scale,  an  atypical  situation.  It  is  tempting  when  solving  combustion  problems  to  choose 
L  =  1,  since  this  often  leads  to  mathematical  simplification  (the  steady  one-dimensional 
premixed  flame  then  has  uniform  enthalpy,  for  example)  but  this  temptation  is  appar¬ 
ently  something  that  should  be  resisted,  at  least  when  dealing  with  unsteady  problems. 

There  are  two  possible  steady  solutions 

H  =  0,  H  =  1  , 

and  the  stability  of  these  solutions  depends  upon  the  sign  of  b: 

b>0(L<l)  H=1  stable,  H  =  0  unstable, 

b<0(L>l)  H  =  1  unstable,  H  =  0  stable. 

Thus  if  L  >  1,  Bush  and  Fendell's  solution  for  the  one -dimensional  flame  is  unstable. 

It  should  be  emphasized,  of  course,  that  only  the  predictions  of  instability  are  signifi¬ 
cant.  A  flame  that  is  stable  to  the  kind  of  disturbances  that  we  have  considered  here 
might  well  be  unstable  to  other  kinds  of  disturbances. 

7.  UNSTEADY  FLAME  WITH  HEAT  LOSSES.  As  we  saw  earlier,  when  there 
are  heat  losses  the  burning  response  is  multiple  valued.  Thus  it  is  of  interest  to  add 
heat  losses  to  the  unsteady  formulation  in  the  hope  of  gaining  insight  into  the  significance 
of  multivalued  responses.  The  result  is 

2  H^  In  H  +  2HK’  +  b  ^  =  0  . 

Note  that  in  addition  to  the  two  steady  branches  shown  earlier  in  Fig.  6,  there  is  a 
third  steady  solution  H  =  0  (Fig.  7). 

"^Sivashinsky,  G.  I.  hit.  J.  Heat  Mass  Transfer,  J7,  1499  (1974). 
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Fig.  7.  Stability  of  a  Flame  with  Heat  Losses  when  L  <  1 


Figure  7  shows  stability  arrows  appropriate  when 
the  solution  will  be  driven  in  an  unsteady  situation. 
AB  and  CD  are  stable,  whereas  CB  is  unstable, 
reserved. 


L  <  1.  These  indicate  the  direction 
Thus  when  L  <  1  the  branches 
For  L  >  1  the  arrows  must  be 


^•  THREE-DIMENSIONAL  UNSTEADY  FLAMES.  The  perturbation  procedure 
is  not  confined  to  one-dimensional  flames.  Three  -dimens  ional  disturbances  can  also 
be  treated  provided  their  nature  is  such  that  the  three-dimensional  terms  are  essentially 
0(1/0).  The  equations  are  rather  more  complicated  since  the  velocity  field  must  be 
determined  and  this  requires  solution  of  the  momentum  equation 

+(v-V)v  =  -  i  Vp  +  v  [V^v  +  ^  V(V*  v)] 
in  addition  to  the  previous  equations. 

Permissable  disturbances  are  defined  in  Fig.  8  (recall  that  the  flame  thickness 

m  C 
P 
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ex 


Fig.  8.  Allowable  Three-Dimensional  Disturbances 


The  time  scale  is  the  same  long  time  scale  as  before. 
The  result  for  the  flame  speed  H  is"^ 


"^This  is  actually  a  limiting  result  only  valid  when  the  heat  released  by  the  reaction 
Q  is  small  compared  to  the  enthalpy  of  the  unburnt  mixture.  In  general  a  single 
equation  governing  the  flame  speed  can  not  be  written  down  when  there  are  three- 
dimensional  disturbances.  Nevertheless,  many  of  the  qualitative  features  of  the 
general  result  are  the  same  as  those  of  the  limiting  result.  The  details  are  in 
Buckmaster,  J.  D.  Combustion  &  Flame  (to  appear). 
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If  we  look  for  perturbations  of  the  one -dimensional  steady  flame  of  the  form 


X  =  T  +  6  e®'^f  (n^?)  ,  6  <<  1 


4  +  4  + 

dr 


=  0 


then 


^  ~  ^  4  (l+'l’oo)^+  4b^  ]  . 

If  k  vanishes,  the  quantity  in  square  brackets  is  either  zero  or  negative  so  that  we 
recover  the  earlier  result  that  the  flame  is  unstable  if  b  <  0  (L  >  1).  But  if  k  ^  0 
there  is  a  positive  root  irrespective  of  the  sign  of  b,  so  that  the  one-dimensional  flame 
is  also  unstable  if  L  <  1. 


The  problem  of  flame  instability  is  an  interesting  and  a  complicated  one.  Exper¬ 
iment  suggests  that  sometimes  instability  destroys  a  flame,  sometimes  it  merely  causes 
it  to  flicker,  and  sometimes  bifurcation  occurs"*".  Most  of  these  observations  are  pres¬ 
ently  unexplained  but  it  is  possible  that  the  above  results  will  play  a  role  in  throwing 
light  on  some  of  these  phenomena.  In  general  we  can  expect  activation  energy  asymp¬ 
totics  to  contribute  signiflc^tly  to  our  understanding  of  flame  instability.  For  example, 
Matkowsky  and  Sivashinsky"*^"  claim  to  have  explained  cellular  flames  in  this  way. 

I  shall  conclude  by  making  some  additional  remarks  about  the  long  time  scale 
that  plays  such  an  important  role  in  the  unsteady  problems  discussed  above.  The  point 
is  best  illustrated  by  considering  a  specific  problem. 


9.  SOLID  DEFLAGRATION.  The  burning  of  a  solid  is  of  fundamental  interest 
in  the  theory  of  solid  propellant  rocket  motors,  and  Fig.  9  shows  a  classical  one- 
dimensional  model.  Tlie  solid  is  hot,  because  of  the  proximity  of  the  flame,  and  gives 
off  a  combustible  mixture  which  burns  within  the  flame.  The  flame  is  propagating  to 
the  left  relative  to  the  gases  but  the  gases  are  moving  to  the  right  and  in  the  steady 
state  the  flame  is  stationary  relative  to  the  solid.  The  burning  rate  depends  upon  the 
pressure  and  a  classical  problem  is  the  determination  of  the  steady  state  burning  rate. 

The  flame  is  essentially  the  same  as  that  analyzed  by  Bush  and  Fendell.  There 
are  differences  in  the  problems,  of  course,  owing  to  the  different  boundary  conditions, 
Md  a  solution  of  the  heat  conduction  equation  has  to  be  constructed  in  the  solid  (which  ’ 
is  being  fed  to  the  right  in  a  flame -fixed  frame)  but  the  analysis  is  straightforward  and 
the  results  have  some  connection  with  experimental  reality.  "*"*""*" 


At  this  point  Fig.  D.  1  (p.78),  Fig.  D.  11  (p.86)  and  Fig.  D.  10  (p.85)  from  Markstein, 
G.  H.  Non-Steady  Flame  Propagation  Agardograph  No.  75,  Macmillan,  New  York, 
1964,  were  shown. 

I  I 

Private  communication. 

Ill 

See  Buckmaster,  J.  D. ,  Kapila^  A.  K. ,  &  Ludford,  G.  S.  S.  Astronautica  Acta 
(to  appear). 
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Solid 


Fig.  9.  Burning  Solid 


A  more  complicated  problem  is  one  for  which  the  pressure  varies  with  time. 
This  also  is  of  interest  in  the  study  of  solid  propellant  rocket  motors  since  such  motors 
are  often  violently  unstable.  Now  if  the  pressure  varies  very  slowly  with  time,  it  is 
apparent  that  the  response  will  be  quasi- steady.  That  is,  the  burning  rate  will  be  the 
steady  state  value  corresponding  to  the  instantaneous  value  of  the  pressure.  The  ques¬ 
tion  then  arises:  What  is  the  slowest  variation  in  pressure  for  which  there  will  be  a 
significant  lag  in  the  burning  response  and  therefore  significant  transient  effects?  The 
answer  is  pressures  that  vary  on  the  long  time  scale 


m""  C 

P 

for  these  will  excite  the  slowly  varying  disturbances.  Indeed,  if  the  appropriate  analy¬ 
sis  is  carried  out  we  find 

where  H  is  the  burning  rate,  p  the  pressure,  and  the  Cj  are  constants.  The  analy¬ 
sis  is  inherently  a  nonlinear  one  but  this  is  the  result  for  infinitesimal  pressure 
variations . 

Flames  are  often  subject  to  external  stimuli  that  change  with  time  and  what  this 
example  suggests  is  that  provided  the  steady  state  solution  is  known,  the  unsteady  prob¬ 
lem  can  be  solved  and  nontrivial  transient  effects  obtained  provided  the  stimulus  changes 
on  the  long  time  scale.  This  could  have  application  to  a  variety  of  important  problems. 
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A  MODEL  FOR  SHOCK  INDUCED  STRUCTURAL  TRANSFORMATIONS 

Paul  Harris 

Concepts  and  Effectiveness  Division 
Nuclear  Development  and  Engineering  Directorate 
Picatinny  Arsenal 
Dover,  New  Jersey  07801 

ABSTRACT.  The  problem  of  strain  propagation  in  a  medium  of  time 
and  strain  (energy)  dependent  elastic  constants  is  considered.  For 
the  elastic  constant  model  considered,  analytic  and  finite  difference 
approximations  appear  to  predict  avalanching  of  the  particle  velocity 
in  a  manner  consistent  with  a  dynamic  strain  induced  exothermic 
structural  transformation.  The  application  to  enhancement  of  laser 
interaction  with  aerospace  materials  is  discussed. 

1.  INTRODUCTION.  Recent  years  have  seen  increasing  military 
interest  in  the  interaction  of  high  power  optical  signals  (lasers) 
with  aerospace  materials.  A  problem  of  particular  interest  has  been 
the  generation  of  a  shock  in  an  irradiated  material  in  order  to  pro¬ 
duce  a  dynamic  mechanical  deformation  in  an  adjacent  material.  The 


high  power 
optical 
signal 


The  snocK  receiving  meaium  i.uuiu  uc  an  ■■■  —  v:  . — 

hardware  application  might  be  a  detonator  or  an  explosive  switch. 

For  the  above  type  of  problem  one  would  obviously  like  to  choose 
the  medium  for  shock  generation  so  as  to  maximize  the  generated  shoe 
amplitude.  There  are  essentially  two  ways  in  which  the  shock  amplitude 
can  be  maximized  for  a  given  optical  signal:  one  can  maximize  pe 
strength  of  the  laser  material  interaction,  or  one  can  attempt  to  find 
a  generation  medium  which  can  act  as  an  amplifier  of  shock  amplitude 
(the  shock  being  produced  in  approximately  the  electromagnetic  skin 
depth  of  the  generation  medium).  In  this  paper  we  will  mainly  con¬ 
sider  some  mathematical  aspects  of  the  second  approach. 

2.  MATERIAL  SELECTION  AND  PROPERTIES.  Some  alloys  exhibit 
"anomalously"  large  Gruneisen  parameters  as  they  undergo  structural 
"phase"  transformations.  Typical  alloy  examples  »  are  TiNi  and 
KTaOo.  The  Gruneisen  parameter  (proportional  to  the  thermal  expansion 
coefficient)  is  a  measure  of  the  pressure  change  caused  by  a  change  in 
thermal  energy  density  under  constant  volume  conditions.  Since,  in  the 
absence  of  vaporization  effects,  the  laser  interaction  serves  to  deposit 
thermal  energy  in  the  skin  depth  region,  an  enhanced  Gruneisen  parameter 
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203 


i§  equivalent  to  an  enhanced  pressure  (shock)  amplitude. 

TiNi  is  an  appropriate  shock  generation  medium  because  its  metallic 
properties,  even  in  the  absence  of  Griineisen  effects,  serve  to  produce 
a  small  skin  depth  and  thus  a  strong  laser-material  interaction.  The 
observed  i  Gruneisen  parameter  enhancement  by  a  factor  of  approximately 
twenty  during  the  near  room  temperature  (martensitic)  structural  trans¬ 
formation  promises  enhancement  of  an  already  strong  laser-material  q 
interaction.  The  practical  limitation  on  the  above  concept  is  a  10  C 
half  maximum  width  for  the  spike  in  the  Gruneisen  parameter,  and  that 
10°C  temperature  rise  represents  a  rather  small  thermal  energy  density 
deposition. 

The  physics  which  gives  rise  to  the  enhanced  Gruneisen  parameter 
also  results  in  exothermic  (or  endothermic)  effects,  and  different 
elastic  constants  on  each  side  of  the  transition.  While  the  observed^ 
exothermic! ty  of  approximately  6  Cal/gm  is  not  large,  when  combined 
with  the  observed  ^  (approximate)  ten  percent  change  in  elastic 
constants,  one  has  a  material  which  promises  interesting  thermo-  3  4 
mechanical  effects.  That  interest  is  further  raised  by  the  knowledge  * 
that  an  applied  strain  can  trigger  the  transformation. 

We  thus  have  a  scenario  in  which  a  propagating  strain  wave  (shock) 
can  trigger  a  structural  transformation,  and  thus  be  amplified  in  the 
process.  It  is  that  secnario  which  we  will  now  model  and  treat  below. 

3.  STRAIN  PROPAGATION  IN  A  TRANSFORMING  MEDIUM.  While  there 
exists  a  number  2,5  of  elegant  approaches  to  the  physics  of  structural 
phase  transitions,  those  approaches  do  not  yet  appear  capable  of  treat¬ 
ing  the  propagating  strain  condition  of  interest  here.  We  thus  proceed 
somewhat  intuitively. 

Consider  a  one-dimensional  strain  problem  (particle  displacement 
only  in  the  direction  of  strain  propagation)  characterized  by 


Po 


(1) 


C  =  c^  +  a  (Ci  -  Cq)^  (2) 

where  p  is  mass  density,  u  is  particle  displacement,  c  is  an  elastic 
constant,  f  denotes  a  viscosity  functional,  the  subscript  zero  denotes 
the  undisturbed  (prestrain  and  pretransformation)  medium,  the  subscript 
one  dentoes  a  final  state  (transformed)  parameter,  and  a  is  dependent 
upon  the  degree  of  transformation. 


We  model  a  in  the  form 


a 


1-  exp 


(3) 
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where  to  is  strain  energy,  W  a  constant,  and  t  is  a  transformation 
incubation  time.  Thus,  to  first  order  in  (wt) 


p 


0 


3^U 

w 


a 


» 


a)t  3^U  _  £ 

Wt  3)^ 


(4) 

(5) 


We  will  set  f  =  o  even  though  it  is  known  ^  that  attenuation  is  very 
strong  in  the  presence  of  phase  transformations.  We  will  thus  have  to 
keep  in  mind  that  any  u(x,t)  solutions  could  in  practice  be  of  con¬ 
siderably  reduced  amplitude. 


We  will  now  consider  two  approximations  to  Eq.  (5).  The  first  will 
be  relatively  unphysical,  but  analytically  neat.  The  second  will  in¬ 
volve  the  full  form  of  Eq.  (5),  but  will  involve  a  rough  finite 
difference  approach. 


APPROXIMATION  I:  We  consider 


■  S  (1  +  -p-  =  0’  ^ 

Separating  variables  with  u(x,t)  =  T(t)  X  (x)  gi 


3^0 


ves 


0  1  32T  _ 


(1  +  et)  T  3t- 


m' 


^  32X 

X  ^ 

^  I 


X  =  Xm  exp 


0  +  ^yT  =  O,  y=(l  +  ft). 

3y  p„p 


Eq.  (9)  is  Airy's  equation  and  its  solutions  can  be  written  as® 

'  1/3 


where 


Y  = 


m^ 

Po^' 


(1  +  ft). 


with  Uj  and  U2  being  linearly  independent  and  tabulated® 


(6) 


(7) 

(8) 

(9) 

(10) 
(11) 
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We  now  let  k  =  2-n/\  =  and  evaluate  Pqp  ^ j2 

and  u)  =  W  (i.e.  the  strain  energy  taken  equal  to  its  critical  trans¬ 
formation  value) 


(12) 

where  v  is  the  velocity  of  sound  in  the  preshocked  medium,  \  is  the 
wavelength  of  the  applied  strain  disturbance,  and  c^  =  cJZ  corresponds^ 

to  an  exaggeration  of  the  transition  (exothermic)  from  TiNi  (II)  to 
TiNi  (III).  And  using  ZttV  =  where  is  the  angular  frequency 

of  the  applied  disturbance. 


m- 


P  „T  •  (2^)^ 


(13) 


Thus  Y  becomes 

V-(2v)^^^  (14) 

For  a  particular  we  can  drop  the  subscript  m  in  Eq.  (10)  and  write 


u(o,o)  =  AUj  j  (2a)QT)^'^^,  1 


and  3u(o.t) 


9t 


+  BU, 


t=o 


A  .2/3  ,/ 


_  B_  ,2/3 


(2a)^T)2/^  1 


(2caQT)^/^  1 


1 


(15) 


J 


.  1  I 


If  we  now  make  the  typical  "hydrodynamic"  approximation  of  co^t  «1, 
then  from  Eqs.  (15)  and  (16) 


(2(o^t) 


2/3 


(16) 


A  =  u(o,o), 
B  =-2(2a)QT)'^/^ 


T  ==  -(2a)QT)^^^  u(o,o). 


(17) 

(18) 


t=o 


where  ^ 


Uj(0,l)  =  1,  82(0,1)  =  0 


(19a) 
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(19b) 


Uj'’(0,l)=0,  U2%.1)  =  1 

have  been  used. 

We  can  thus  write 

u(o,t)  &  u(o,o)  Uj  (Y,l)  -  u(o,o)  Ug  (Y,l).  (20) 

We  thus  predict  avalanching  of  the  particle  displacement  at  the 
boundary,  u(o,t),  due  to  the  avalanching  behavior  of  (Y,l).  The 

avalanching  is  strong  as  it  is  occuring  even  in  the  presence  of  a 
harmonic  input. 

APPROXIMATION  II.  Here  we  will  consider  a  crude  finite  difference 
version  of  Eq.  (5)  written  with  respect  to  an  almost  constant  velocity 
coordinate  system. 

From  Eq.  (5) 


where  v^^  s  c^/p^  ,  v^^  =  ^  u  =  M  ,  and  W  h  with 

being  a  critical  strain  value. 

Employing  the  so-called  ^  characteristic  stretching  transformation 

5  £  X  -  Vt,  ^  s  aVt,  (22) 

where  a  is  a  dimensionless  stretching  parameter  (we  shall  neglect  terms 
in  a2),  and  defining 

’l'(5,d=||.  (23) 

we  arrive  at 

2aV2f^  +  (v/  -  V2)  +  (Vj2  -  v/)  j  =  0,  (24) 

where  =  aVx. 

Writing  Eq.  (24)  in  crude  finite  difference  form 
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llln* 


^n,) 


where 


V2  .  V  2 


(26a) 


P  ,  ''l^  ■  ''o^  . 

2  - 


(26b) 


Rewriting  Eq.  (25)  with  the  time  derivative  single-stepped  backwards 

9 1 VGS 


»(n,m)  -  *(n.m-l)  =  Gj  -  Gji;  *2(n,m)  I  »(n  tl.m)  + 


Gj  -  G2t„»2(r,.m)  ^ 


where 


:.  S'  (n+1,0)  = 


1+G,  /  . 

-g-^  t(n,o)  - 
bl  G, 


_  We  now  set  T(n,-1)=0  (equivalent  to  turning  the  strain  on  at  t=0, 
and/or  completely  neglecting  the  stretching  parameter).  With  that 
condition  Eq.  (29)  has  a  solution 


f(n,o)  = 


'i'(o,o). 


Eq.  (30)  predicts  a  geometrical  avalanching  (wave)  in  position,  in 
support  of  the  temporal  avalanching  of  Eq.  (20). 

•  known  ®  that  the  martensitic  transformation 

in  Fe-Zy.o/fe  N1  propagates  at  a  velocity  approximately  one  third  v  . 

If  we  thus  choose  V  to  be  that  velocity  of  propagation  of  the  trans¬ 
formation,  then  Gi  is  large  (a  being  small)  and  negative.  Thus  the 
spatial  avalanching,  while  present,  does  not  appear  to  be  as  strong 
as  the  avalanching  in  time.  ^ 
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4.  DISCUSSION.  The  two  approximations  considered  above  hint 
strongly  that  shock  amplification  can  occur  in  the  presence  of  a 
structural  transformation.  Considerably  more  work  is  , 

however,  before  the  prediction  of  an  amplification  factor  is  possible. 

In  closing  we  will  briefly  list  what  we  believe  to  be  the  promising 
approaches  for  future  work. 

(a)  Modeling.  The  inclusion  of  microscopic  effects  (e.g.  soft 
phonon  and  interatomic  potential  effects)  in  the  modeling 

of  a. 

(b)  Attenuation.  It  is  conceivable  that  known  strong  attenuation 
during  the  transformation  process  could  severly  limit  the 
predicted  amplification.  While  experimentally  i  deterained 
attenuation  factors  in  TiNi  lead  us  to  believe  that  this  is 
not  the  case,  f  0  must  be  included  at  least  for  completeness 

(c)  Soli  ton  propagation.  The  current  fad  in  spatially  bounded 
non  linear  propagation  effects  involves  soliton  5, 9  physics. 

It  is  necessary  to  seek  solutions  of  Eg.  (5)  from  such  a 
point  of  view. 


(d)  Finite  differencing, 
of  approximation  II. 


It  is  necessary  to  refine  the  work 
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3 

ABSTRACT.  It  takes  of  the  order  of  N  operations  to  solve  a  set 
of  N  linear  equations  in  N  unknowns.  When  the  underlying  physical 
problem  has  some  time-  or  shift-invariance  properties,  the  coefficient 
matrix  is  of  Toeplitz  (or  difference  or  convolution)  type  and  the  equations 
can  be  solved  with  O(N^)  operations.  We  have  shown  that  with  any  non- 
singular  N  X  N  matrix,  we  can  associate  an  integer  a  between  1  and 

N  such  that  it  takes  0(N  Oi)  operations  to  invert  the  matrix.  The  number 
a  may  be  small  for  many  non-Toeplitz  matrices  of  physical  interest.  Some 
aspects  of  this  result  are  discussed  here,  including  extensions  to 
continuous-time  kernels  and  integral  equations. 

1.  INTRODUCTION.  Problems  in  many  fields  lead  ultimately  to  the 
solution  of  linear  matrix  equations 

Ra  =  m  , 

where  R  is  a  given  N  x  N  matrix,  say,  and  m  is  a  given  N  x  1  vector 
The  number  of  operations  required  to  solve  such  an  equation,  or  to  find 
is  of  the  order  of  N^  (multiplications  and  additions).  This  can  be 
prohibitive  if  N  is  large  (500  or  1000  or  3000,  as  can  arise  in  many 
power  system  or  econonometric  calculations).  For  this,  and  other  reasons, 
we  must  often  try  to  bring  in  any  special  features  or  structures  that  may 
be  present  in  the  original  physical  problem.  In  many  applications 


^This  report  is  a  summary  of  a  talk  given  at  the  22nd  Conference  of  Army 
Mathematicians,  Watervliet  Arsenal,  New  York,  May  1976.  It  was  based  on 
work  done  jointly  with  B.  Friedlander,  L.  Ljung  and  M.  Morf  (see  the 
references) . 

This  work  was  supported  by  the  Air  Force  Office  of  Scientific  Research,. 
Air  Force  Systems  Command  under  Contract  AF44-'620-74-C-0068 ,  and  in  part 
the  Joint  Services  Electronics  Program  under  Contract  N00014-75-C-0601 , 
and  the  National  Science  Foundation  under  Contract  NSF-Eng  75-18952. 
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we  have  the  property 


That  is,  the  phenomena  are  invariant  to  a  change  in  the  time-  or  space- 
origin  (e.g.,  as  with  stationary  random  processes,  or  homogeneous  media, 
etc.).  In  this  case,  the  matrix  R  is  said  to  be  a  Toeplitz  matrix  and 
ha^  the  nice  feature  thit  its  inverse  can  be  found  with  only  O(N^)  multi¬ 
plications.  Moreover  the  inverse  can  be  computed  recursively,  i.e.,  the 
N  X  N  inverse  can  be  easily  updated  to  yield  the  (N  +  1)  x  (N  +  1) 

Inverse,  and  Toeplitz  matrices  also  have  other  useful  properties. 

Unfortunately,  most  operations  on  Toeplitz  matrices  destroy  the  Toeplitz 
property.  For  example,  the  inverse  of  a  Toeplitz  matrix  is  not  Toeplitz, 
unless  the  matrix  is  also  lower-  or  upper-triangular .  So  also  the  product 
of  two  Toeplitz  matrices  is  not  Toeplitz,  unless  the  matrices  are  also 
both  lower-triangular  or  both  upper-triangular.  However,  some  reflection 
will  show  that  in  various  ways  one  can  regard  certain  matrices  as  being 
less  non-Toeplitz  than  others,  though  present  solution  methods  cannot 
take  advantage  of  this— they  require  O(N^)  operations  in  the  Toeplitz 

3 

case,  and  0(N  )  otherwise. 

By  a  long  process  of  abstraction  and  simplification  of  results  originally 
obtained  for  certain  nonlinear  differential  equations  [l  ],  [2  ],  we  have 
been  able  to  show  essentially  the  following  (more  precise  results  are 
stated  later) :  with  any  invertible  N  x  N  matrix  R  we  can  associate 
an  integer  o:,  1  ^  0!  ^  N,  such  that  it  takes  0(N^Ci:)  operations  to 
compute  its  inverse.  The  integer  a  may  be  called  the  displacement  rank 
(or  index  of  nonstationarlty)  of  the  matrix  and  has  the  property  that  it 
is  low  for  matrices  that  are  Toeplitz  or  near  to  Toeplitz,  while  it  is 
high  for  arbitrary  matrices.  For  example. 


i)  a  =  1  for  R  =  L  or  U  or  LU  or  UL,  where  L  and  U 
denote  lower-  and  upper-triangular  Toeplitz  matrices, 
ii)  a  =  2  for  R  =  (L  +  U)  and  R  =  (L  +  U) 

iii)  a  <  4  for  R  =  (L^  +  (Lg  ^2^ 

iv)  a  <  3  for  R  =  [Lj^  +  ;  Lg  +  Ugl 

v)  a  <  n,  if  R  is  the  covariance  matrix  of  a  linear  combination 

of  the  components  of  any  n-vector  wide-sense  Markov  random 
process. 

3 

in  such  cases,  0(N%  can  often  be  significantly  less  than  0(N  ), 
thus  yielding  many  advantages,  not  just  for  solving  a  given  large  set  of 
equations,  but  also  for  interactive  adjustment  of  the  mathematical  model 
(i.e.,  of  R  and  m)  based  on  actual  examination  of  the  now-more-easily 

determined  solution  a. 

we  shall  outline  our  major  results  in  Section  2,  for  matrix  equations. 

A  similar,  and  somewhat  simpler,  development  can  be  carried  out  for 
integral  equations,  as  noted  in  Section  3.  Section  4  contains  some  con¬ 
cluding  remarks  on  possible  extensions  and  generalizations. 
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the  matrix  case.  More  details  can  be  found  in  the  paper  [  3], 
but  we  note  the  key  definitions  and  results  here. 

Definition  1.  The  (+) -displacement  rank  of  an  N  x  N  matrix  R  is  the 
smallest  integer  Q!^(R)  such  that  we  can  write 

%  (R) 

R  =  Z 

for  some  lower-triangular  Toeplitz  matrices  (L^}  and  some  upper- triangular 
Toeplltz  matrices  {U^}. 

Definition  2.  The  (-) -displacement  rank  of  an  N  x  N  matrix  R  is  the 
smallest  integer  C^(R)  such  that  we  can  write 

q_(R) 

R  =  2  u.i:, 

1  ^  ^ 

for  some  lower-triangular  Toeplitz  matrices  {£.}  and  upper-triangular 
Toeplltz  matrices  {U^). 

Definition  3.  Let 

Z  =  the  lower-shift  matrix 


1  0 


c  ^ 


Lemma  1.  Computation  of  Displacement  Ranks 


where 


a(R)  =  pU(R)) 


J(R)  =  R  -  ZRZ 


p{A}  =  the  rank  of  the  matrix 


Also 


0L(R)  =  p(r(R)} 
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where 


(2) 


P(R)  =  R  -  Z’RZ  . 


The  proof  follows  by  using  the  result 
Lemma  2.  Given  two  column  vectors  x,,y 
of  the  functional  equation 


of  Lemma  2. 

there  is  one  and  only  one  solution 


J(R)  =  xy’ 


(3a) 


and  this  is 

R  =  L(x)U(y')  t  (3b) 

where  '  denotes  transpose,  L(x)  is  a  lower-triangular  Toeplitz  matrix 
whose  first  column  is  x.  and  U(y’)  is  an  upper- triangular  Toeplitz 
matrix  with  first  row  y' . 

Proof.  For  uniqueness^ note  that 

J(Rl)  =  J(V 

implies 

R^  -  ZRiZ'  =  ^2  " 

or 

-  R^  =  Z(R^  -  R^Z’  , 

whose  only  solution  is  clearly  zero* 

The  rest  amounts  to  verifying  that  jL(x)U(y’)  =  xy* .  which  the 
reader  may  find  amusing  to  check  by  direct  computation  for  3x3  matrices 
Lemma  1  now  follows  easily  from  the  observation  that 

R  =  ?  L(x.)U(yp  ^  J(R)  =  ? 

1  ^  ^ 

can  now  state  a  first  simple, but  apparently  new, result. 

Theorem  1 • 

a_(R"S  =  a^(R)  .  (5) 


Therefore , 


a+(R) 

R  =  E  L.U. 
implies  that  R  has  the  form 

R  =  E  ‘ll.£. 

1  ^  ^ 


<6a) 

CSb) 


Proof.  We  give  the  simple  proof  (suggested  by  S-Y.  Rung)  because  it 
shows  that  the  result  is  quite  general  and  depends  very  little  on  the 
nature  of  the  entries  of  R— — for  example,  they  could  themselves  be  matrices. 


V/e  note  that 

a_(R  =  p{r”^  -  z'r“^z} 

=  o((r”^  -  Z'r“^Z)R) 

=  p{I  -  Z'r'^ZR} 

since  rank  is  unaffected  by  multiplication  by  a  nonsingular  matrix.  Now 
by  a  well-known  matrix  result  that 


p{I  -  AB]  =  pfi  -  bA} 
we  can  continue  the  above  chain  as 

a_(R~^)  =  c{I  -  zrz'r-^j 

=  p{(l  -  ZRZ*r"^)r} 

=  cfR  -  ZRZ*} 

=  Q:^(R)  .  ■ 

Example.  If  T  is  a  symmetric  Toeplitz  matrix,  then  a^(T)  =  2  =  a  (T) 
since  we  have  the  representations 


where 


T  =  T  •!  +  I*T' 

+  + 

=  I  •T  +  T’  •!  , 
+  + 


=  the  lower-triangular  part 
of  the  matrix  T. 
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The  fact  that 


a^(T)  =  2  =  a_(T) 

can  also  be  seen  by  checking  that 
J(T)  =  T  -  ZTZ* 

=  rt.  t_  ...  t  n  .  for  all  N>2 


N 


o 


'N 


and 


r(T)  = 


o 


N 


N-1 


N  >  2  . 


•  2  '•1 

Now  it  turns  out  to  have  been  well-known  in  many  contexts  (see  the 
discussion  in  [4  ])  that  there  exist  two  lower-triangular  Toeplitz 
matrices  A  and  B  such  that 


-1 


=  A’A  -  B'B 


(7) 


SO  that 

a_(T)  =  2  =  a^(T)  .  H 

Remark.  Notice  that  the  displacement  ranks  seem  to  identify  a  better 
property  of  matrices  than  their  being  Toeplitz,  The  class  of  Toeplitz 
matrices  is  not  closed  under  inversion,  unlike  the  (+) -displacement  ranks 
and  the  corresponding  representations  (6  )• 

2 

Theorem  2,  The  inverse  of  an  N  x  N  matrix  R  can  be  found  with  0(N  OO 
multiplications,  where  a  is  an  integer  such  that  <  a  <  +  2,  This 
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can  be  done  via  certain  recursive  formulas  called  the  generalized  Szego 
Levinson  recursions. 

The  recursions  are  a  bit  too  complicated  to  describe  here,  but  we 
may  note  that  for  Toeplitz  matrices  they  are  equivalent  to  the  well-knorm 
recursions  for  the  Szego  polynomials  orthogonal  on  the  unit  circle  (see, 
[5,  Ch.  11  ]  or  [  s])*  These  were  rediscovered  in  the  statistics 

literature  by  Levinson  [7  ]  and  by  Durbin  [ 3  ]  for  recursively  solving 
the  so-called  Yule-Walker  normal  equations  [  9  J . 

For  other  results  in  the  matrix  case,  we  refer  to  [ 3  ],  [lo]-[llJ,  and 
instead  turn  briefly  here  to  an  examination  of  the  integral  operator  case. 
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3.  INTEGRAL  EQUATIONS > 


The  Fredholm  integral  equation  of  the  second 


kind  ip 

a(t)  +  y*  K(t,s)a(s)  ds  =  m(t)  ,  0<t  <T  (8) 

0 

has  been  extensively  studied,  see,  e.g.,  the  recent  monograph  [12].  Except 
for  the  handful  of  cases  where  explicit  analytic  solution  is  possible,  the 
generic  technique  is  to  replace  the  integral  equation  by  some  approximating 
set  of  N  linear  equations 

Ra  =  m  . 

This  can  be  done  in  various  ways~use  of  degenerate  kernels,  projection 
(Galerkin  and  collocation)  methods,  etc.  For  example  in  the  degenerate 
kernel  method  we  replace  K(t,s)  by  the  function 

Kjj(t,s)  =  Z  (9) 

for  some  suitably  chosen  functions  ,^^(0  }•  m  any  case,  the 

resulting  set  of  linear  equations  will  in  general  require  0(N  )  operations 
for  their  solution  and  this  may  be  prohibitively  large.  More  significant 
however  is  the  observation  that  such  approximation  methods  will  generally 
destroy  any  nice  structure  that  might  have  been  present  in  the  original 

problem. 

For  example,  if  the  kernel  was  of  Toeplitz  (also  called  displacement 
or  convolution)  type, 

K(t,s)  =  K(t  -  s)  ,  say 


then  in  general 


K  (t,s)  ^  Toeplitz  for  N  <  - 


This  is  bad,  because  the  Toeplitz  property  can  be  exploited  to  find  a 
nice  solution  of  the  integral  equation..  Briefly,  first  define 
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H(t,s)  =  the  Fredholm  resolvent  of  K(t,s) 


as  the  solution  of  the  integral  equation 

T 


+J  H^(t,r)K(r,s)  dr  =  K(t,s)  ,  0<t,s  < 


In  operator  notation,  we  can  write  this 


H  +  HK  =  K 


(I  -  H)  (I  +  K)  =  I 


which  shows  that  the  original  equat ion 


can  be  resolved  as 


(I  +  K)a  =  m 


a  =  (I  +  K)  m  =  (I  -  H)m 


a(t)  =  m(t) 


-/Sr<t. 


s)in(s)  ds  . 


Therefore  the  basic  problem  is  to  find  H(t,s).  Now  even  though  K(t  -  s) 
may  be  Toeplitz,  this  will  not  in  general  be  true  of  its  resolvent  H;j,(t,s) 
(for  T<«).  Nevertheless  H^(t,s)  is  not  a  completely  arbitrary  kernel, 
but  should  in  some  sense  be  close  to  a  Toeplitz  kernel  (after  all,  its 
resolvent  is  Toeplitz) . 

We  can  quantify  this  intuitive  feeling  in  the  following  way  (the 
analog  of  the  method  used  in  Section  2) .  Define  the  operator 


jK(t,s) 


and  note  that 


("It  + 


K(t,s)  , 


jK(t  -  s)  =  0  . 


If  K(t,s)  is  not  Toeplitz  jK(t,s)  ^  0,  but  it  will  be  some  function 
of  two  variables , which  we  can  write  as 
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jK(t,s)  =  E  ^At)t(.s) 

^  (d  *  (01  and  some  integer  a,  possibly  even 

for  some  functions 

inllnite.  Bowver  let  us  Ueilne  the  dlsplaoe«e»t  rank  of  K<t.s)  as 
the  snallest  integer  a(0  snch  that  the  representation  (11)  Is  possible. 

Eramples.  1)  K  is  Toeplitz,  a(B  -  0. 

il)  K(t,s)  .  minCt.s),  the  covariance  of  the  slnplest 
nonstationary  random  process,  the  Wiener  process. 

Clearly  jK(t,s)  si  and  a  =  1. 
iii)  K(t,s)  =  ts  -  min(t,s) ,  the  covariance  of  the 
so-called  Brownian  bridge  process .  Now 

jK(t,s)  =  s  +  t  -  1  and  a  =  2.  H 

»e  can  show  the  follonlng  resnlt,  analogons  to  Theorem  1  in  the  matrix 

case  • 

Theorem  3.  a(Hj(t,s))  <  a(K(t,s)  +  2. 

g^ample.  IVhen  K  Is  Toeplitz.  a(K)  -  0.  However  even  though  its 

resolvent  KjCt.s)  is  not  Toeplitz,  there  exist  two  functions  V'- 

B^(.)  such  that  ^ 


jH^(t,s)  =  A^(t).<^.j,(s)  - 

SO  that 

a(H.j.(t,s))  =  2  . 

Moreover  the  functions  V'  V"' ’  of  one  varl.hle,  can  he 

metermlned  more  easily  than  functions  of  two  variables.  In  fact  they 


can  be  obtained  via  the  differential  equations 


( 


I  -  -  -  ®T<^>\«>  ■  ^  ‘  " 


4B^(t)  =  -  A^(t)B^(t)  ,  0  <  t  <T 

with  certain  easily  determined  boundary  conditions  .A^(0)  and  B^(T) 

^Th^ris  the  analog  of  (7)  in  the  matrix  case. 

22T  : 


(13a) 

(13b) 


The  point  is  that  these  differential  equations  can  be  solved  by  a  simple 
recursive  procedure,  which  needs  only  proportional  to  operations, 

where  N  is  the  number  of  points  in  [0,T]  used  in  any  discretization 
procedure , 

We  call  (13)  Krein-Szego-Levinson  equations  because  they  are  exactly 
the  recursions  found  by  Krein  [ 13]  for  the  continuous  analogs  of  the 

*• 

Szego  polynomials  on  the  unit  circle. 

Theorem  4.  If  K(t,s)  has  displacement  rank  a,  H,p(t,s)  can  be  found 
with  a  times  as  much  computation  as  in  the  Toeplitz  case.  The  solution 
is  found  recursively  via  a  set  of  generalized  Krein-Szego-Levinson 
equations. 

Proofs  and  further  results  can  be  found  in  the  papers  [14]-[15]. 

However,  we  might  draw  explicit  attention  to  the  fact  that  though  we 

are  using  a  degenerate-kernel  representation  in  (11),  this  is  for  jK(t,s) 

and  not  for  K(t,s).  Even  though  jK(t,s)  is  degenerate  it  can  be  seen 

by  integration  that,  in  operator  notation, 

Offl 

K  =  Z  L.U. 

1  ^  ^ 

where  the  {L^}  and  {U^}  are  lower-  and  upper- Volterra  operators. 
Therefore  K  can  be  very  far  from  a  degenerate  kernel.  The  feature 
of  our  method  is  that  it  preserves  any  "Toeplitz- like"  structure  that  may 
be  present  in  K(t,s).  This  thought  is  pursued  a  bit  further  in  Section  4. 
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4.  r.ONCLUDING  REMARKS.  We  have  taken  Toeplitz  kernels  as  basic 
because  they,  or  things  close  to  them. arise  in  many  applications  of  interest 
to  us.  However  in  other  problems,  other  "nice"  kernels  may  be  more  basic. 

For  example,  we  might  have  Hankel  kernels 

K(t,s)  =  K(t  +  s)  ,  say  . 

integral  equations  with  such  kernels  can  be  solved  efficiently,  and  there¬ 
fore  it  may  be  of  interest  to  classify  kernels  in  terms  of  their  degree 
of  "non-Hankelness".  This  can  clearly  be  done  as  above  by  using  the 

operator 

which  gives  zero  when  applied  to  Hankel  kernels.  Similar  results  can 
also  be  obtained  for  basic  kernels  of  the  form  K^Ct  -  s)  +  K2(t  +  s). 

Furthermore  we  could  also  define  "second"  and  higher-order  operators 

of  the  type 

J^{K(t,s))  =  (li  + 

and  so  on.  It  is  easy  to  find  examples  where  these  are  particularly 
appropriate. 

As  a  final  comment,  we  should  express  our  feeling  that  the  basic 
ideas  described  above  should  be  adaptable  to  a  variety  of  different 
situations.  Also  there  is  clearly  some  quite  general  algebraic  structure 
lurking  behind  our  results,  which  some  of  the  people  in  this  audience 
may  be  better  equipped  to  identify  than  we  can. 
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EXACT  SOLUTION  TO  AN  ELASTIC-PLASTIC  DEFORMATION 
PROBLEM  IN  A  RADIALLY  STRESSED  ANNULAR  PLATE 


Peter  C.  T.  Chen 
Benet  Weapons  Laboratory 
Watervliet  Arsenal 
Watervliet,  NY  12189 


ABSTRACT.  An  exact  solution  to  the  small  strain  contained 
plastic  deformation  problem  in  an  annular  plate  under  internal 
pressure  is  obtained  on  the  basis  of  the  deformation  theory  of  Hencky, 
the  Mises  yield  criterion  and  a  modified  Ramberg-Osgood  law. ^ 
Expressions  for  the  stresses,  strains  and  displacement  are  given. 

Some  numerical  results  have  been  worked  out  and  assessed  by  using  the 
Budianky's  criterion  for  the  acceptability  of  the  deformation  theory. 


1 .  INTRODUCTION. 


_ _ _  The  problem  is  a  partly  plastic,  annular _ ^ 

pi  ate' radially  stressed  by  uniform  pressure.  The  material  is  assumed 
to  be  elastic-plastic  and  obeying  the  Mises  yield  condition.  For 
ideally  plastic  materials,  the  stress  solution  for  this  problem  was 
first  obtained  by  Mises  [1]  and  the  corresponding  two  strain  solutions 
were  recently  obtained  by  the  present  author  on  the  basis  of  ^9^"  '^2 
deformation  and  flow  theories  [2].  The  numerical  results  obtained  by 
using  these  two  theories  indicate  that  the  strain  differences  are 
very  small  and  compressibility  of  the  material  should  be  considered. 
However,  there  is  no  published  solution  for  strain-hardening  materials, 
which  is  the  purpose  of  the  present  investigation. 


In  the  present  paper,  an  exact  elastic-plastic  solution  for 
strain-hardening  materials  is  given  on  the  basis  of  J?  deformation 
theory  together  with  a  modified  Ramberg-Osgood  law  [3].  Exact 
solutions  based  on  this  particular  model  were  given  recently  to  an 
infinite  sheet  having  a  circular  hole  under  uniform  external  tension 
[3]  and  internal  pressure  [4].  This  paper  considers  annular  plates 
of  arbitrary  inner  and  outer  radii.  Some  numerical  results  are 
presented  and  the  limitations  of  the  solution  are  discussed. 

2.  BASIC  EQUATIONS.  Assuming  small  strains  and  neglecting 
inertia  forces  in  the  axisymmetric  state  of  plane  stress,  the  radial 
and  tangential  stresses,  and  Oq,  must  satisfy  the  equilibrium 
equation, 

00  =  (3/3r)  (roy.)  ;  (U 
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and  the  corresponding  strains,  and  cq,  are  given  in  terms  of  the 
radial  displacement,  u  ,  by 


=  9u/3r  ,  Eg  =  u/r  .  (2) 

We  shall  assume  that  the  material  is  elastic-plastic,  isotropic, 
obeying  the  simple  deformation  theory  and  the  strains  are  related  to 
the  stresses  by 

er  =  E  +  (Eg'^-E"^)  (a^-  1  09)  (3) 

eg  =  E-’(oe-vcr^)  +  (E^-'-r')  (cj-  i  a^)  ,  (4) 

Where  E,  v  are  elastic  moduli  and  E.  is  the  secant  modulus  on  the 
effective  stress-strain  curve  with  e|  =  a/e  and 

a  =  (0/  +  ae)^/2  .  (5) 

If  a  modified  uniaxial  relation  of  the  Ramberg-Osgood  type  is  assumed 


Eg  ^  -  E  1  for  a  <  a^;  Eg  ^  =  E"^  for  o  ^  Oy  (6) 

and  the  initial  yield  surface  is  defined  by  the  ellipse  a  =  Uy. 

4-h  compressibility  of  the  material  is  taken  into  account 

the  longitudinal  strain  e  can  be  determined  by 


+  £0  +  £7  =  E'''(l-2v){a  +  a.)  . 


(7) 


T  ■  £0  ■  ^2  ^  UQ 

which  holds  in  the  elastic  as  well  as  plastic  region. 

The  boundary  conditions  on  the  problem  are 

t)  =  -  P  ,  a^(b,  t)  =  0  .  (8) 

Where  a,  b  and  P  are  the  inner,  outer  radii  and  internal  pressure, 

stresses,  strains,  and  displacement 
must  be  continuous  throughout  the  entire  region. 

In  the  following,  the  solutions  will  be  presented  in  terms  of 
nondimensional  quantities  defined  by 


ot  =  a/b,  ^  =  r/b,  3  =  p/b,  p  =  p/Cy  , 

Sr  =  a^/Oy,  Sg  =  cTg/o-y,  S  =  a/Oy  , 

=  Eey./ay  ,  eg  =  Ecg/o^,  =  Ee^/ay  .  (9) 
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whsrs  r  -  p  locstss  ths  6l3Stic~pl3Stic  boundsry. 

3.  ELASTIC  REGION.  For  small  pressure  (p  the  plate 

will  be  elastic  throughout  (a  _<  C  1  1)  the  solution  is 

I  =  p(a-2-l)“‘'  (1  +  Ch  , 

h  -I 

I  =  p(a-2-lTl[(l-v)  +  (1  +  v)?"^]  . 

60  J 

=  -2p  (a‘^-1)"^  V  .  (10) 

The  critical  value  p*  to  cause  incipient  deformation  is 

p*  =  (l-a^)  [3  +a^]-''/2  (11) 

For  values  of  p  larger  than  p*,  the  plate  becomes  plastic  in  the 
inner  region  (a  <  ?  <  3)  and  is  still  elastic  in  the  outer  region 
(3  <  C  <  !)•  In  the  outer  elastic  region,  the  equations  for  the 
dimensionless  stresses  and  strains  are 


(1  +  ?"^)/(l  +  33'*^) 


4x1/2 


9 


e 


r 


60 


=  [(1  -  v)  e  (1  +  v)rt/(i  +  33"'^)^''^ 


» 


=  -  2  v/(l  +  33"^)^^^  •  (12) 

4.  PLASTIC  REGION  (a  <  ?  <  3.  P**  >  P  i  P*)-  Following  Nadai 
for  isotropic  problems“[5],"we  Tntroduce  the  parametric  representation 

(0  _<  (j)  <  it/2) 

S^  =  -S  cos(f)/sin(Tr/3) 

S  =  -S  cos(<j)  +  26)/sin('iT/3)  (13) 

9 
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which  satisfies  equation  (5)  identically  and  leads  to  the  following 
equation  upon  substituting  into  the  equation  of  equilibrium, 

K  =  [sin(TT/3)(tan(ir/6)  +  tanc}))]"^ (tan(|)d(f)  -  S"^ds)  .  (14) 


By  the  extended  Mitchell  theorem  [6],  the  stress  solution  for  the 
present  problem  is  independent  of  v.  So  choose  v  =  1/2  and  then 
equations  (3),  (4),  (6)  and  (9)  lead  to 


=  (Sr  -  Se/2  )s"-'  . 

H  =  (Se  -  Sr/2  )S"-’  .  (15) 

The  compatibility  equation  follows  from  (2)  and  (9)  as 


=  0/9U(eee)  . 

Substituting  (15)  into  (16)  with  the  aid  of  (13),  we  can  obtain 
?  ^d?  =  [-sin(7r/3)(cot(Tr/6)  +  cot(j))]“^  (cot(tid(j)  +  nS"^dS)  . 
Combining  (14)  and  (17)  yields 

S  ^dS  =  (tan(|)  +  tan(iT/6))/(l  -  n  tan(|)  tan(Tr/6))  .  d(}) 


which  can  be  integrated  with  the  known  condition  at  the  elastic- 
plastic  boundary.  Since  S  and  4)  are  functions  of  C_and  $,  the 

After 
s  given  by 


notation  S 
some  mani 


”.5(5,3),  (pco  =  <j)(5.B)  are  introduced  [2]. 
ipuTation,  the  relation  between  S^^  and  is 

n  sin(f)gg  -  /J  cos4)gp 

_n  sin(j)^g  -  coscf)^^ 


P  r 
exp 


(n-l),/J 
n^+  3 


^‘*’33  *^e3^ 


where  y  =  (n  +  3)/(n^  +  3), 


(16) 

(17) 

(18) 


(19) 


and 

tan<f>g3  =  (3^/v^  +  i/J)/(l-3^) 
follows  from  (12)  and  (13)  at  5  =  3. 


(20) 


Substituting  (18)  into  (14)  and  carrying  out  the  ii 
the  known  condition  at  the  elastic-plastic  boundary,  we 


have 


(6/?)^  =  F(4)^g) 
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and 


4n 


sin(<|)  +tt/6) 

F(<|).r)  = - ^ - r 

sin((J)gg+7r/6) 


n  sin(|)gg  -  v'J  cos(l) 

n  sintfi  -  coS(j) 

5p 


§i 


+  3 


X  exp 


(n^-l)  3 


2x‘3 
n  +3 


(21) 


from  which  4^.  can  be  solved  as  a  function  of  C  and  3.  At  the  inside 
surface,  ?  -5,  (J)  =  4)  g.  thus  the  expression  relating  a, 3  and  p  can 
be  written  parametric§iTly  as 

p  =  cos(f.„g/sin(Tr/3) 

(3/a)^  =  F((|)j^g)  ,  (22) 

where  S^r  and  by  (19)  and  (21).  respectively.  By 

examining  (19)  and  (21),  it  can  be  found  that  P.  .. 

4’a6-^<t>o  =  tan-l  (A'n)  for  finite  n.  It  should  be  noted  that  for  the 

present  problem  we  always  have  4)a3l‘l’53l‘f’33  'f*ol.‘f’a3l‘^’otal‘f'3B^*^l  1  ~  ^  ^ 

Now  we  have  completed  the  stress  solution  which  is  given  by  (13), 

(19),  (20).  (21)  and  (22). 

The  solution  for  the  strains  in  the  plastic  region  (aS<6,  P>P*) 
of  an  elastic-plastic  (finite  n)  plate  can  be  obtained  from  (3),  I4J 
and  (7),  using  (6),  (9)  and  the  above  stress  solution.  After  some 
manipulation,  the  equations  for  the  dimensionless  strains  can  be 
written  as 

=  -S"g  sin(<l)^g+Tr/3)  -S^gCOs((})gg+iT/3) (^v)/sin(iT/3) 


Og  =  sin(|)^g  -  S^g  coS(j)^g(^v)/sin(ir/3) 


=  '^^?3  ■  cos((l,^g+iT/6) 

where  S  and  ())_o  can  be  evaluated  as  functions  of  C  and  3  by 
equatio^i  (19),  f20)  and  (21). 
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5.  DISCUSSION  OF  RESULTS.  Since  the  deformation  theory  is 
used,  the  validity  of  the  above  solution  should  be  assessed  by 
applying  Budiansky's  criterion  [7]  which  requires  the  following 
inequality  to  be  satisfied. 

[(ns""^-l)/(s"-‘'-l)]l/2  ^  ^24) 

For  any  values  of  n,  the  ranges  of  S  and  (j)  over  which  the  inequality 
may  not  be  valid  can  be  determined.  In  the  present  case,  the  above 
inequality  is  satisfied  except  over  a  certain  range  of  S  and  (b  for 
n>17  [4].  ^ 

Another  limitation  of  the  above  solution  is  due  to  the  small 
strain  assumption.  In  the  case  of  annular  plates  with  arbitrary 
ratio  of  inner  to  outer  radius  a  ,  there  may  exist  two  types  of 
plastic  flow.  Full  plastic  flow  with  complete  yielding  may  happen 
for  larger  values  of  a  .  In  the  case  of  a  flat  ring  with  smaller 
values  of  a  ,  it  is  impossible  to  obtain  complete  yielding  in  it 
through  applying  a  pressure  on  its  inner  boundary.  The  outer  portion 
of  the  ring  must  remain  strained  elastically  and  a  case  of  partial 
plastic  flow  with  thickening  will  occur.  Neither  full  plastic  flow 
for  larger  a  nor  partial  plastic  flow  with  thickening  for  smaller  a 
win  be  permitted  under  the  assumption  of  small  strain. 

Some  numerical  results  have  been  worked  out  for  the  2219-T87 
aluminum  plate  with  geometric  ratio^b/a  =  3.  The  material  constants 
[4]  are  n  -  9,  v  =  0.3,  E  =  10.5xl0®psi,  ay  =  5.5xl0^psi.  The  effect 
Of  p/a  on  the  radial  and  tangential  stress  distributions  are  shown 
in  Figures  1  and  2,  respectively.  The  corresponding  strain 
distributions  for  the  radial,  tangential  and  axial  components  are 
shown  in  Figs.  3,  4  and  5,  respectively.  Finally  it  should  be  noted 
that  the  validity  of  the  above  results  based  on  the  deformation 
theory  have  been  assessed  by  applying  Budiansky's  criterion.  The 
range  of  S  and  ^  for  the  above  stresses  and  strains  satisfy  the 
inequality  (24). 
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AN  EFFECTIVE  STIFFNESS  VISCOELASTIC  COMPOSITE  BEAM  THEORY 


Charles  R.  Thomas 
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Watervliet,  New  York  12189 


ABSTRACT.  Viscoelasticity  in  the  individual  beam  layers  is  _ 
modeled  according  to  the  standard  linear  model  and  the  Timoshenko  beam 
theory  with  the  resulting  equations  utilized  in  deriving  a  micro¬ 
structure  or  effective  stiffness  viscoelastic  lamina ted  beam  theory. 

A  time  harmonic  wave  propagation  along  the  length  coordinate  of  the 
viscoelastic  composite  beam  has  been  utilized  to  illustrate  an 
application  of  the  derived  theory  and  to  point  outthe  influence  of 
the  various  viscoelastic  and  geometric  parameters  involved. 

The  first  task  in  deriving  the  viscoelastic  laminated  beam 
theory  was  to  formulate  energies  for  individual  viscoelastic  layers 
in  terms  of  the  Timoshenko  beam  theory  in  a  form  suitable  for 
developing  the  composite  theory.  A  goal  of  the  direct  derivation  of 
the  beam  theory,  instead  of  the  intermediate  step  of  developing  a 
viscoelastic  laminated  continuum  theory  which  must  then  be  reduced  to 
a  beam  theory,  was  accomplished  by  the  introduction  of  a  gross 
rotation  term  for  the  laminated  beam  into  the  derivation  of  individ¬ 
ual  layer  energy  relations.  The  final  result  was  an  energy  conser¬ 
vation  law  for  the  individual  beam  layers  in  terms  of  kinetic  energy, 
potential  energy,  and  dissipation  energy. 

The  viscoelastic  laminated  beam  is  composed  of  a  number  of 
alternating  plane,  parallel  layers  of  two  homogeneous,  isotropic 
viscoelastic  materials  which  are  respectively  termed  the  reinforcing 
layer  and  the  matrix  layer.  To  obtain  the  total  energy  for  the 
viscoelastic  composite  beam,  the  individual  layer  kinetic,  potential, 
and  dissipation  energies  were  sunmed  over  the  n  layer  pairs  of  which 
the  composite  beam  was  composed.  The  discrete  system  thus  obtained 
was  then  converted  to  a  continuous  system  by  means  of  a  smoothing 
operation,  that  is  a  replacement  of  the  resulting  energy  summations 
by  weighted  integrations  over  beam  thickness.  A  reduction  of  one_ 
variable  from  the  formulation  was  made  possible  through  a  continuity 
condition  resulting  from  continuity  of  displacement  across  layer 
interfaces.  The  final  result  of  the  derivational  work  was  a  set  of 
three  flexure  equations  of  motion  and  corresponding  boundary 
conditions  for  viscoelastic  laminated  composite  standard  linear  model 
Timoshenko  beams. 
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Time  harmonic  waves  of  the  form 

were  passed  through  the  three  equations  of  motion  and  resulted  in  a 
characteristic  equation  in  terms  of  p,  the  circular  frequency;  c, 
the  phase  velocity;  a,  the  attenuation  coefficient;  and  the  numerous 
viscoelastic  and  geometric  parameters  involved. 

1.  INTRODUCTION.  A  great  deal  of  work  has  been  accomplished 
in  the  area  of  elastic  laminated  effective  stiffness  or  microstructure 
continuum  theories  and  approximate  plate  and  beam  theories.  By  the 
same  token,  little  has  been  accomplished  with  viscoelastic  counter¬ 
parts  to  these  theories. 

An  elastic  continuum  theory  which  included  effective  stiffness 
for  both  the  reinforcing  and  matrix  layers  of  a  laminated  continuum 
was  developed  by  Sun,  Achenbach,  and  Herrmann  [1,  2].  The  continuum 
theory  was  utilized  by  Thomas  [3]  to  study  the  simple  thickness 
modes  for  laminated  media  with  layering  both  parallel  and'perpen- 
dicular  to  the  plate  free  surfaces.  Sun  [4]  deduced  a  two  dimen¬ 
sional  theory  for  laminated  plates  from  the  three  dimensional 
continuum  theory.  Velocity  correction  coefficients  were  introduced 
into  the  two  dimensional  theory  by  Thomas  [5]  and  flexural  and  exten- 
sional  vibrations  for  plate  strips  and  rectangular  plates  were 
studied  by  Thomas  [6,  7]  according  to  this  theory  and  compared  to 
similar  results  from  effective  modulus  plate  theories.  A  micro¬ 
structure  theory  for  an  elastic,  laminated  composite  beam  was  developed 
by  Sun  [8]  and  the  approach  utilized  in  this  paper  will  be  followed 
in  deriving  a  viscoelastic,  laminated  composite  beam  theory. 

Thomas  [9]  showed  that  the  flexure  beam  theory  in  reference  [8]  is 
directly  obtainable  through  a  simple  reduction  of  the  existing 
flexure  equations  for  composite  plates  [4,  5]. 

A  continuum  theory  for  a  viscoelastic  laminated  composite  was 
developed  by  Grot  and  Achenbach  [10],  however  the  equations  developed 
were  not  applied  to  any  problems  of  wave  propagation  or  vibration. 

It  is  certainly  theoretically  possible  to  start  with  the  equations 
in  reference  [10],  to  make  appropriate  series  expansions  and  derive  a 
plate  theory,  and  to  then  follow  reference  [9]  to  make  a  direct  reduc¬ 
tion  to  a  viscoelastic  beam  theory.  However,  for  convenience  and 
simplicity  of  analysis,  the  approach  in  the  current  report  will  be  to 
begin  with  the  viscoelastic  Timoshenko  beam  equations  and  work 
towards  a  viscoelastic  laminated  beam  equation  in  the  manner  of 
reference  [8].  With  somewhat  guarded  conclusions.  Stern,  Bedford, 
and  Yew  [11]  have  demonstrated  a  definite  need  for  an  effective 
stiffness  type  formulation  for  viscoelastic  laminates. 
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The  current  approach  to  obtaining  a  viscoelastic  laminated 
beam  theory  will  be  a  viscoelastic  development  which  mirrors  the 
elastic  development  given  by  Sun  [8].  Surprisingly*  the 
difficulty  is  in  obtaining  the  energies  for  a  single  layer  modeled 
as  a  viscoelastic  Timoshenko  beam.  The  most  pleasing  and  straight¬ 
forward  development  of  suitable  viscoelastic  Timoshenko  beams  results 
from  a  utilization  of  viscoelastic  constitutive  relations  of  the 
differential  form;  it  is  these  equations  which  yield  a  viscoelastic 
development  which  closely  mirrors  Sun's  [8]  elastic  derivation. 


2.  THE  ENERGY  PRINCIPLE.  As  Sun  [8]  does  in  the  development 
of  an  elastic  laminated  beam  theory,  the  first  task  in  deriving  a 
viscoelastic  laminated  beam  theory  is  to  formulate  energies  for 
individual  viscoelastic  layers  in  terms  of  the  Timoshenko  [12J  beam 
theory.  In  the  past,  Lee  [13]  developed  viscoelastic  Timoshenko 
beam  equations  for  viscoelastic  extensional  strain  but  the  shear 
strain  was  left  elastic.  Pan  [14]  extended  the  analysis  to  include 
viscoelastic  shear  strains.  The  current  objective  is  to  develop 
the  viscoelastic  Timoshenko  beam  equations  in  a  form  more  suitable 
to  the  development  of  a  viscoelastic  composite  beam  theop'.  A  first 
goal  will  be  the  development  of  a  single  layer  energy  principle 
suitable  for  a  direct  application  in  the  derivation  of  a  multilayer 
energy  principle. 


The  development  of  an  approximate  theory  such  as  for  laminated 
elastic  plates  has  originally  been  a  two  step  procedure.  In  the 
first  instance,  the  Mindlin  plate  theory  [15]  in  its  first  order 
approximation  was  utilized  to  develop  a  continuum  theory  for  laminated 
composites.  Then  to  obtain  a  laminated  plate  theory  a  first  order 
approximation  is  made  on  those  variables  which  came  from 
order  part  of  the  Mindlin  theory  as  in  Sun  [4]  and  Thomas  [5]  -  this 
explanation  will  become  clear  shortly.  Now  in  developing  anelastic 
laminated  beam  theory.  Sun  [8]  has  made  both  of  these  approximations 
simultaneously  to  obtain  a  flexure  theory  for  laminated  beams. 
Actually,  Thomas  [9]  has  shown  that  the  flexure  beam  theory  is 
directly  obtainable  from  the  existing  flexure  plate  theory. 


The  current  objective  is  to  immediately  derive  a  viscoelastic  _ 
laminated  beam  theory  and  to  not  have  to  develop  a  viscoelastic  lami- 
nated  continuum  theory  first.  In  making  the  various  zero  and  first 
order  expansions  of  displacement,  terms  which  lead  to  an  extension 
theory  are  also  maintained  since  the  second  expansion  of  extensional 
displacements  leads  to  a  flexure  term.  The  first  order  displacements 
which  will  result  in  the  Timoshenko  beam  equations  [12]  for  flexure 
as  well  as  an  extensional  equation  for  beams  are 
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v(y,z,t)  =  v(y,t)  -  z(j)(y,t) 

w(y,z,t)  =  w(y,t)  -  z$(y,t).  (1) 

the  zero  order  terms  in  (1)  are  V  and  w  and  a  first  order  expanison 
of  these  two  displacements  results  in  the  expressions 

v(y,t)  =  v^(y,t)  -  zj^  ipcv(y,t) 

Ct  U.  u. 

w(y,t)  =  w^(y,t)  -  z^  'i'^(y,t)  (2) 

where  the  subscript  a  =  1,  2  will  later  denote  whether  a  stiff  or 
soft  laminated  beam  layer  is  indicated  and  the  superscript  k  which 
layer  pair  is  indicated.  While  absolutely  necessary  at  this  point, 
the  notation  in  (2)  jumps  into  the  laminate  notation  while  seeming 
to  be  at  the  single  layer  stage  of  development.  See  Sun,  Achenbach, 
and  Herrmann  [1]  or  Sun  [8]  if  clarification  is  required. 


which 


Combining  equations  (1)  and  (2)  and  extracting  only  those  terms 
result  in  flexural  motion  results  in  the  displacement  relations 


v(y,z,t)  =  -zjii;c((y,t)  -  z<J)(y,t) 
w(y,z,t)  =  wj(y,t) 


(3) 


where  U)a(y»t)  represents  the  gross  rotation  in  the  laminated  beam, 
wHy,t)  represents  the  transverse  deflection,  and  <f>(y,t)  represents 
the  individual  layer  rotation.  The  various  displacements  and 
rotations  on  the  right  side  of  (2)  represent  the  reduction  from  a 
laminated  continuum  theory  to  a  laminated  beam  theory;  thus,  from 
continuity  of  displacement  and  rotation  at  laminate  interfaces,  it  is 
clear  that  the  notation  may  be  simplified  to  w(y,t) 

'^(y.t)  =  for  a  =  1,  2  and  for  all  values  of 

these  notational  simplifications  in  mind,  the  final 
order  flexure  displacement  expansion 


=  w|(y,t)  and 
k.  Hence  with 
form  of  the  first 


IS 


v(y,z,t)  =  -z^ip(y,t)  -  Z(p(y,t) 
w(y,z,t)  =  w(y,t) 


(4) 


where  these  equations  are  valid  only  when  eventually  utilized  in 
developing  a  laminated  beam  theory.  Equations  (4)  may  be  reduced  to 
those  for  a  homogeneous  or  single  layered  beam  by  setting  il;(y,t)  =  0; 
this  being  done,  equations  (4)  reduce  to  those  given  by  Brunelle  [16] 
for  flexure  of  a  beam. 
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The  non-zero  strain-displacement  relations  are 


^yz  2 


3v 

ay  "Sz 


The  non-zero  stress  equations  of  motion  which  pertain  to  the  problem 
are 

ayz,y  =  pvi 

I 

From  the  appendix  and  equations  (A-1 7)  the  constitutive  equations  for 
a  special  case  of  the  standard  linear  model  are 


(1  +  c  ^)Oy2  =  (2kG  +  2k V  ,9) 


(1  +  C  =  (E  +  E  (7) 

where  shear  correction  coefficients  k  and  k  have  now  been  introduced 
in  a  manner  similar  to  that  of  Timoshenko  [12]  and  Hindi  in  and 
Deresiewicz  [17]. 

The  procedure  involved  in  deriving  the  theory  will  be  to  manipu¬ 
late  the  left  sides  of  equations  (6)  until  they  are  of  the  form  of  the 
left  sides  of  equations  (7).  Thus,  taking  the  first  time  derivatives 
of  (6)  and  multiplying  by  the  viscoelastic  constant  C  results  in  the 
equations 

C  =  PC  W 

*y.y  * 

which  when  added  to  their  counterparts  in  equation  (6)  become 


a  +  C  a  ,  =  pw  +  pC  w 
yz,y  yz,y 


a  +Co  +0  +C0 

y.y  y,y  yz.z  yz.z 


pv  +  pC  v’  (9) 


Multiplying  the  first  equation  of  (8)  by  w  and  the  second  equation  by 
V,  integrating  over  the  beam  volume  and  time,  and  finally  adding  the 
final  answers  results  in  the  equation 
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/ 

•'A  "^0  •'o 


<V.y  "  V.y)**  (“y.y  "  C  <’yj^ 

*  '°yz.z  V,z>’ 


dA  dy  dt 


'  C  C  ^ 


V  V  +  C  'v  V  +  w  w  +  C  ‘w  w) 


dA  dy  dt 


(10) 


After  several  integrations  by  parts,  equation  (10)  may  be 
expressed  as 


/ 


(Oyz  +  C  ayz)w  +  (Oy  +  C  &„)v 


yz 


dA  dt 


A  O  O  c|2 


^A  ^0 


(0  +  C  0  )v 

yz  yz^'' 


dA  dy  dt 


U  ^0  P 


(a  +  C  a  )  (M  +  i^) 
yz  yz'  ^ay  02^ 


*<“y  ^  '  “y>  57 


(w  +  C  w)  w  +  (v  +  C  'v)v 


dA  dy  dt 


dA  dy  dt 


it  is  inmediately  clear  that 


L  r  I 


H  rt  d 

o  o  az 


(o  +  C  a  )v 
yz  yz 


dA  dy  dt  =  0 


since  both  beam  surfaces  are  stress  free  and  that 


/  r 

'A  •'0 


(0^2  +  C  ay2)  w  +  (a  +  C  a  )v 


(11) 


(12) 


dA  dt  =  0  (13) 
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since  the  boundary 
Applying  equations 
account  equations 


terms  will  be  satisfied  at  the  beam  ends. 

(5)  and  (7)  to  equation  (11)  and  taking  into 
(12)  and  (13)  results  in 


=  f  lo  fo  C  w  )w  +  (v  +  C  7  )v]  dA  dy  dt  (14) 

A 

But,  from  the  chain  rule  of  partial  differentiation  it  is  clear  that 

■^[e^]  =  ee  +  ee  (1^) 

or  that 

2  dt 

Similarly,  the  fact  that  an  indefinite  integral  can  be  defined  as  a 
definite  integral  with  a  variable  upper  limit 

/g(t)dt  =  /^g(t)dt  +  const.  (17) 

immediately  results,  after  taking  a  time  derivative  of  both  sides,  in 
the  equation 

g(t)dt  =  g(t)  (18) 

dt  “ 

which  for  g(t)  =  results  in  the  relationship 

e2  =  d  (19) 

dt  ■'o 

A  direct  application  of  relations  (16)  and  (19)  to  equation  (14)  with 
an  introduction  of  equations  (4)  and  (5)  results  in  the  equation 
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f'  ft  _d 

^0  ^0  dt 


^AkG(|^-^)2  +  AkV/J(|-^)2dx 

^  ^  dx 

+  f(|f)^  +  E*I/*(^)2  dx 


dy  dt 


+  ft  P  d_ 
■^0  0  2  dt 


AW  -  2AC/^w^dT  +  A(z^)  i/) 

+  -  2AC/^(z*^)  ip  dx 

0  a 

,t.2 

-  2IC  /  <|)  dx 
0 


dy  dt  =  0 


after  an  integration  over  the  beam  area  where 

A  =  bd  ,  I  = 

12 

with  b  being  the  beam  width  and  d  being  the  beam  thickness. 

Following  Anderson  [18],  a  conservation  law  is  sought  in  the 
existence  of  a  quantity  H  such  that 


H  =  constant. 


such  that  obviously 


where 


H  =  T  +  U  +  V  (24 

with  the  quantities  T,  U,  and  V  being  called  the  kinetic  energy,  the 
potential  energy,  and  the  dissipation  energy.  From  a  comparison  of 

equations  (20),  (23),  and  (24)  it  is  clear  that  the  various  energies 
may  be  defined  as 

T-  f/*T*dydt 
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and  from  equation  (20)  it  is  clear  that  the  energies  are 

T*  =  P  [Aw^  +  +  I(|)^] 

2  ^ 

u‘ = "  ffy 


* 

V  = 


r 


AkV(|tt  4)^  +  E*I(|f)^ 

+  AE'Czl^j^d^)^  -pAC  w' 

Ot  oj 

u  **2 

-  pAC  (z^)>  -pIC  (j) 


dx 


(26) 


IS 


3  THE  LAMINATED  BEAM  THEORY.  The  laminated  beam.  Figure  1 
composed  of  a  number  ot  alternatTng  plane,  parallel  layers  of  two 
homogeneous,  isotropic  viscoelastic  materials  which  are  respectively 
termed  the  reinforcing  layer  and  the  matrix  layer.  The  reinforcing 
layer  is  the  stiffer  of  the  two  layer  combination  and  is  indicated 
by  the  subscript  "I"  while  the  softer  matrix  layer  is  indicated  by  the 
subscript  "2".  The  elastic  constants,  the  viscoelastic  constants,  the 
layer  density,  and  the  thickness  for  the  reinforcing  and  mtnj  layers 
respectively  are  E-j ,  G-j ,  E-j,  G-] ,  C^,  p-j ,  d] ,  and  E2,  G2,  £2,  u2,  ^2* 

P2» 

The  basic  variables  involved  are  w,  the  transverse  deflection; 
lb.  the  gross  rotation  of  the  stiff  layer;  and 

soft  layer.  The  midplane  positions  for  the  kth  pair  of  neighboring 
reinforcing  matrix  layers  are  yf  and  y|  respectively 
Figure  1,  with  the  layer  midplahes  taken  perpendicular  to  the  z-axis. 
The  width  of  the  beam  is  b  and  the  total  or  gross  thickness  is  h. 

From  equation  (26),  the  kinetic,  potential,  and  dissipative 
energies  in  the  individual  layers  are 


•‘fc  =  £^A,w2  .  A„(zi)V  -  Ia'I 

,  A^,k,2(|t,2  . 


where  a  =  1,  2  respectively  gives  the  reinforcing  and  matrix  layer 
energies. 

Now,  the  three  energies  are  summed  over  the  n  layer  pairs  to 
determine  the  total  energies  for  the  composite  beam 


(28) 


It  is  now  convenient  to  convert  the  discrete  system  (28)  to  a  contin¬ 
uous  system  by  utilization  of  a  smoothing  operation,  that  is  to 
replace  the  surmiations  in  (28)  by  weighted  integrations  over  the 
thickness  variable  z. 
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The  result  of  the  smoothing  operation  is  the  energies 

*  1  *  * 

^  “  .h/4  (w 

*  1  *  * 

U  "  /  7TT7T  ^'^1  ^ 

-h/2  (di+d2)  '  2 


T 


,  X  X 

(V,  +  VJ  dz 


where  after  smoothing 


-h/2  (^1+^2)  1  2 


z  = 

1  ^2' 


(29) 


(30) 


Carrying  out  the  integrations  in  (29)  in  terms  of  (27)  and 
taking  into  account  (30)  results  in  the  energies 

^  *  f2''2)  15^  +  P2^2>  ♦' 


*  ’  „  I  h 
2  ‘’I’l 


_ i  2  +  ,  h  -2 

(dl+dj)  *1  2  ‘’2'2  (d,+dj  ♦2 


*  1 
U  =rA,k,G 


2  .  1 


"  ^*1^1  *  *2^2)  Td^i)'  ^  i  E,I,  ^  (^)2 


t  i 

2  2  2'3y  ^ 
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/: 


A,  ^*4  ^  tw 

+  +  A2E2)  -(d^THp-  (if)  ^1^1  (d^+d2) 

0 


+  E2I2 


h  ,9‘I’2‘2 


(ci,+d2) 


(j-p^)  -  (PiA^C^  +  P2A2C2)  (d^+d2) 


“2 

w 


dT. 

(31) 


-  T2^PiAiC^  +  P2A2C2)  Xd^+ap"^  ■  Id^+dp’^1 

■  Xd^  ^2 

At  this  point,  continuity  of  displacement  at  the  i^iterface  of 
the  kth  pair  of  layers  must  be  considered.  Applying  equation  (4)  to 
a  multilayer  beam  results  in  the  equation 

(32) 


VQ^(y,z,t)  =  -z^  (y.t)  -z(t)^  (y.t) 


Z,t  -  -^2 


and  with  the  aid  of  Figure  2  it  is  clear  that 

k  ‘^1 

Vi  =  -z-f’l'  *  *  ^2 

at  the  interface  between  layers  1  and  2.  It  is  also  clear  from 
Figure  2  that 

Z2'  =  -  f(d,.d2) 

and  that  equations  (33)  describe  the  same  interface  such  that 

Vi  =  V2  . 


(33) 


(34) 


(35) 


From  equations  (35)  applied  to  equations  (33)  it  is  clear  that  the 
continuity  condition  is 


where 


i|i  =  n<(»i  +  (i-n)<|)2 


n  = 


(d^+d2) 


(l-n) 


(d^+d2) 


(36) 

(37) 
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(38) 


Following  Sun  [8],  the  variable  ({.g  is  eliminated  such  that 

^2.  -  (i-n) 

where  for  convenience  the  notation  4>  =  <|)i  has  been  introduced. 

Expression  (38)  is  directly  substituted  into  equations  (31)  and 
the  dimensionless  variable 


is  introduced  to  yield  the  energy  expressions 

2  .  1 


(39) 


T*  =  ^(p/i  +  P2A2)  w  + 

•  "2 

+  ^Plil^^  +  -  TiV^ 

u*  =  |cAik^Gi(^  -<t>)^  +  ■  rr^ 

+  ^nEl  +  (l-ri)E2)  1  ^1^1  ^9y^ 

.  lj,P  T  /  1  M  _  _JL_li)2 

^  2^^2^2^rTO  sy  (1-n) 


'5A,k*G*(|^  4)^  +  5A2k2G’(M  - 

-  l[,[nP]C,  +  (l-n)P2C2]i^  -  EPiIi^i  ? 
■  ■  (l-p)'*'’ 


dT 

(40) 


where 


.  bh3 
k  =  T7 


(41) 
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Now,  all  the  squares  of  the  various  sums  in  equations  (40)  are 
expanded  out  to  yield  the  final  forms  of  the  energy  expressions  as 


where  the  constants  a^-  are 

^1  ^1*^1^1 

^2  ~ 


33  =  -  A2k2G2/(l-n) 

®4  "  Pl^l  ■*■  ^2^2 

ag  =  +  (l-n)E2]  +  ^f::^ 

®6  °  =  a^/d-n) 

’7  '  (tV  ^2'2 

“8  °  (i^  ''2''282  '  nSj 
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I.  Pp^Z 


n 


10  =  XT^  ^2^2 


3i  1  “  Et  In  '*’ 


E„I 


11  "  ■^n  (l-n)2  2*2 

r-2 

3n  o  ~  Ai  ki  Gt 


12  -  "rn  •  XT^ 

^2 

®13  "  Pl^l  XT^^2^2 


(43) 


which  corresponds  to  the  elastic  constants  given  by  Sun  [8]  for 
elastic  laminated  beams  and  where  the  constants  b^  are 

bi  =  A-jk-jG^  +  f^2^2^2 


*  * 


^2  ~  A2k2^2/ ( ^ 

^"3  =  ^''I'^l  ■  J^'^2^2^2 


•^4  Pi  ^1^1  ■'■  P2^2^2 

^b  *  n  *1  ^^12 

bj  =  AnE,  *  (I-njEj]  *  ^ 


be  =  A2k2G2/(l“Ti)  =  b2/(l-n) 


n  * 


*  * 


(l-n) 


2  ^2^262  ~  ^^6 


bg  =  -^np^C^  +  (l-ri)p2C2]  +  (l^n)^  P2^2^2 
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(44) 


*  2  * 
b,,  =  E,I,  +  — n  ^  E  I 
"  "  0^2  2 

I  n  I  n2  ★  it 

^  ^  (7-n)2 

'’'3  =  ^>'l'=l  "^f’2'2'2 

f°  y’'=^':oelastic  contribution  of  the  current 

!:»i£|p|“ 

it  conjunction  with  equations  (42) 

It  IS  easy  to  form  energy  princiole  (22)  that  ic  /^u//^4-  n  u-  ^  ^ 

+  b-^  +  b.  w 
L  39y  4 

■  ’20  -  =5  J  *  ^6*  *  ^7  J  -  %*  ^ 

*  •  ^0*  -  *>21?  -  1=5^  *  bgj,  t  Jtjj, 

b/-  '’lO^' 

p  Iw  .  a^w,  .2.  1 

’33.  ^  ^7^  -  a/ -  a,,?  - 

*  4*'^o**  ^  ^12y  a, 3$  -  bjl^  e  b^0  .  .  b,^'  .j,, 

■'"3  ^‘’12**^, 3* 


dtdy 


dtdy 
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+  /Sw 
0 


lb* 


+ 

0 


+  hfv  ■  M 


3w 


lay  '2'"  -3^  19y 


M.  ai^+h-^  h-^ 
®53y  ■  73y  ‘^Say  ^7 ay 


dt 


dt 


0  a 

73y  ■  '"7  ^  ■  ‘'n3y 


M  ^  L,  ^  .  h  34. 

-  a_-;rr  +  a,,-  -  b-,  ttt  +  bnTrr 


dt  =  0.  (45) 


The  viscoelastic  equations  of  motion  and  boundary  conditions  for 
laminated  beams  are  now  obtained  by  applying  the  first  lemma  of  the 
calculus  of  variations  to  equation  (45).  Thus,  the  three  equations 
of  motion  are 


2: 
ay 
2 


zf  *  ^6^  -  "6*  ■  “7^  *  "8*  ^  "zw  *  ”5^  ■  '’e* 


2 

-  b_^  +  bg4  =  ag$  -  *  ^9^^  ‘  ^10^ 


3y 


*  ’8*  ^  =1 V  ■ 

2 , 

*  ”110  ■  ‘’iz*  “  ■  “in^  *  ’13*  ■  ‘’lo'i'  ”13* 


(46) 


and  the  corresponding  boundary  conditions  are 

”1^  -  ”z*  ■  *  "ily  ■  ‘’2*  ■  ”3 


or 


w  =  d  on  y  =  O.t 


(47-a) 
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ar--^  -  a  +  h  K  ^  -  n 


i|;  =  0 


y  =  0,£ 


a  -  a  +  h  -  h  -  n 

^7ay  ®n3y  ^  ‘"yay  hl^" 


(47-b) 


(f)  =  0 


y  =  o,i 


(47-c) 


,  .4= — WAVE  PROPAGATION.  Following  Sun  [8],  but  with  a  visco¬ 
elastic  counterpart,  assume  flexural  wave  propagation  in  the  v- 
di recti  on  of  the  form 

0)  =  hWe'^y 

i/;  =  'fe'"^ 

<p  =  $e^y  eip(y/c-t) 

Where  ais  the  attenuation  coefficient,  p  is  the  circular  frequency 
and  c  IS  the  phase  velocity.  It  is  also  convenient  at  this  time  to* 
introduce  some  additional  relationships  as 

p  =  Zttu 


,  27i:c 

A  =  —  =  CT 


3  =  Xa  (49) 

Where  ^  is  the  frequency,  X  is  the  wave  length,  t  is  the  period.  K 
Is  the  wave  number  and  3  is  the  attenuation  constant. 
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Now.  equations  (48)  are  passed  through  differential  equations 
(46)  to  obtain  the  characteristic  equations  for  wave  propagation. 
At  the  same  time,  the  following  dimensionless  parametric,  elastic, 
and  viscoelastic  dimensionless  variables  are  introduced 
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(50) 


The  findl  form  of  the  characteristic  equation  for  viscoelastic  wave 
propagation  is 


(Rll  +  i  (R]2  ^  ^12^  (*^13  ^  ^1$) 

(Ri2  +  i  I12)  (R22  +  i  I22)  (^23  '  ^23^  =  ^ 

(Ri3  +  i  113)  (R23  +  i  I23)  {R33  +  i  I33)  (51) 
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Where 


Rit  =  -an  3^  +  bn  3  -  +  dn 


Ri2  = 

R21 

—  - 

■  bi2  ^  ^12 

II 

ro 

**31 

=  . 

-  b^3  3  +  di3 

R22  = 

®22 

3^ 

^22  f>  ■'■ 

■^22 

'*23  " 

•*32 

=  ' 

2 

-  *3^23  ^  ^?3  ^ 

■  C23V 

II 

CO 

CO 

ai 

®33 

“  ^33  B  +  C33V^ 

■  ^33 

*11  = 

^11 

3^ 

+  Bn  3  +  CnV^  ■ 

-  Dll 

II 

CVJ 

1— 1 

*21 

= 

Bi2  B  +  0^2 

*13  " 

*31 

Bi3  3  +  D^3 

*22  "  ■  ^22  ^  “  ®22  ^  “  ^22^^  ^  ^2Z 
*23  ^  ^32  "  ^23  ^  ^  hz  ^  ^2^  ~  ^23 

*33  =  -  ^33  -  ^33  ^  -  ^33^^^  ^  '*33 


The  constants  introduced  into  equation  (52)  are  defined  as 

a„K 

.  _2_2. 


n 


'll 


Ct  If  Y 

rr 


★  * 


4ir  a2k*Y2 


(52) 
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n 


11 


2 

4tt  So] 

I2~ 


2 

4Tr  a^k^Y 


2 
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2 

4Tr  02  k2 


12 


12 


13 


02  k2 
(l-n)X 

-  ,  *  * 

27702  k2Y2 

(l-ri)X 
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_  S'lr'^Gai^T'j  Stt  012^2 


D^l  -  TTb^^ 


263 


B  .  12 

'2  l; 


O'! 2  ~  2TTb'j2 


R  -  13 

^13  - 


D]3  =  2rt,3 


Bjb  ■  ^™22 


-  _  8itBe|^(l~n)c2 


p  4r  ic  if 

Atj  €2^2  “2*^2'^2 
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A  -  ^23 
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23 


'33 


^33  ^ 


2 

4iT6^ei  4irr) 

~Y~*Ju^ 

3  —  3  2  — 

Sir  0e-|Ci  8it  n  ^2^2 


33 


4iT^6’^e^  4it  ^^{2^2 


....  n\kp- 


(53) 


A  numerical  solution  to  characteristic  equation  (51)  is  possible 
if  it  is  recast  as  the  following  function 

f(3,V)  =  ABS  (DETlR^-j  +  Hijl).  (54) 


Using  a  numerical  technique  such  as  the  Rosenbrock  [21]  optimization 
procedure  a  solution  is  obtained  when 

f(B,V)  =  0  .  (55) 
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5.  SUMMARY.  An  energy  principle  has  been  formulated  for 
viscoelastic  Timoshenko  beams  according  to  the  standard  linear 
model  with  the  stipulation,  and  hence  additional  terms,  that  the 
energy  principle  be  utilized  in  building  a  viscoelastic  laminated 
beam  theory.  The  Timoshenko  model  considered  has  accounted  for 
both  viscoelastic  extensional  and  viscoelastic  shear  strains.  To 
later  incorporate  the  single  layer  energy  principle  into  the  develop¬ 
ment  of  a  laminated  beam  theory,  a  term  which  accounts  for  the  beam's 
gross  rotation  was  included  in  the  single  layer  development. 

Using  the  single  layer  energies  developed,  a  viscoelastic  lam¬ 
inated  beam  theory  composed  of  a  number  of  alternating,  plane, 
parallel  layers  of  two  homogeneous,  isotropic  viscoelastic  materials, 
termed  the  reinforcing  layer  and  the  matrix  layer,  was  derived.  In 
deriving  the  theory,  the  individual  layer  kinetic,  potential,  and 
dissipative  energies  were  summed  over  n  layer  pairs  to  obtain  the 
total  energy  of  the  composite  beam;  these  results  are  converted  to  a 
continuous  system  by  utilization  of  a  smoothing  operation  or  weighted 
integration.  The  number  of  independent  variables  in  the  total  composite 
beam  energies  is  reduced  from  four  to  three  thru  the  introduction  of 
a  condition  for  continuity  at  layer  interfaces.  A  direct  application 
of  the  energy  principle  developed  to  the  composite  beam  energies 
results  in  a  set  of  three  equations  of  motion  and  their  corresponding 
boundary  conditions  for  viscoelastic,  laminated  composite  beams. 

Flexural  wave  propagation  has  been  considered  by  passing 
viscoelastic  harmonic  waves  through  the  derived  equations  of  motion. 
Numerical  solutions  are  possible  by  applying  the  Rosenbrock 
optimization  procedure  to  the  resulting  characteristic  equation.  A 
lack  of  computation  funds  precludes  the  presentation  of  numerical 
results  at  the  present  writing. 
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APPENDIX.  The  present  objective  is  to  derive  a  set  of  consti¬ 
tutive  relations  which  can  be  utilized  in  conjunction  with  the  basic 
equations  for  a  Timoshenko  beam.  While  constitutive  equations  may 
be  formulated  in  either  integral  or  differential  form,  preliminary 
work  in  the  direction  of  formulation  of  a  viscoelastic  beam  theory 
for  laminated  composite  materials  indicates  that  the  differential 
form  of  constitutive  relations  will  be  most  useful.  The  differential 
constitutive  relations  will  be  utilized  in  the  present  development. 


The  general  form  of  the  differential  constitutive  equations  is 
adapted  from  Fung  [20]  where  the  stress-strain  relations  are  of  the 
form 


where 


P,(D)o;.  =  Q,(D)e!j 

=  Q2(D)e|,k  (A-D 

P^(D)  and  Q^.(D)  are  given  by 

k=ni  u 

PAO)  =  I  a.D*^ 

'  k=o 

k^n^  . 

Pp(D)  =  I  ^C.  o'" 

k=o  ^ 

k-m 

Qt(D)  =  I  lb  d'^ 

'  k=o  ^ 

k=mi  i, 

Qp(D)  =  I  ^d  d'^  (A-2) 

^  k=o 


with  D  being  the  time-derivative  operator  of  the  form 

dV  =  {A-3) 

at’ 

and  where  a!-  and  e\-  are  the  components  of  the  stress  and  strain 
deviators  ^  ^ 


269 


o' 

ij 


el . 
ij 


1  - 

0 .  .  -  X  6 . 

U  3  i; 

.  0 

I  kk 

e. .  -  -  6. 
IJ  3  i: 

j  ®kk 

(A-4) 


in  which  and  e^-j  are  the  components  of  stress  and  strain. 

Now,  assume  equations  (A-1)  to  have  the  form  of  the  standard 
linear  model 


(1  (B.cA)e 


(A-5) 


where  a  is  stress  and  e  is  strain.  Comparing  the  form  of  (A-5)  with 
equations  (A-1)  it  is  clear  that  to  have  the  form  of  the  standard 
linear  model  it  must  be  true  that 

n-j  =  m-]  =  n2  =  m2  =  1  (A-6) 


and  operators  (A-2)  in  light  of  (A-6)  reduce  to 

(D)  =  +  a^D 

Q^(D)  =  +  b^D 

P2(D)  =  Cq  +  C^D 


Q2(D)  =  d^  +  d^D 


(A-7) 


As  will  be  subsequently  seen,  the  only  non-zero  stresses  and 
strains  for  a  Timoshenko  beam  with  its  y-axis  along  the  length  and  its 
z-axis  through  the  thickness  are  Oy  and  a  &  £„  and  Thus,  from 

equation  (A-4)  the  non-zero  stress'^and  strain  deviators  are 


°y  3 


a;z  =  a 


yz 


2 

3  ^y 


yz 


yz 


(A-8) 
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Now,  a  direct  substitute  of  equations  (A-2),  (A-7),  and  (A-8)  into 
equation  (A-1)  results  in 

[1  +  (a^/aQ)0]ay2  =  [(V^o^  ^ 

[1  +  (a^/aQ)D]ay  =  [(bQ/a^)  +  (bi/aQ)D]ey 

[1  +  (Ci/Co)D]0y  =  [(do/C^)  +  .  (A-9) 

There  are  thus  two  equations  for  stress-strain  in  the  y-  coordinate 

DiOy  =  D2Cy 

DaOy  =  040^  ,  (A-IO) 

where 

=  l-(a^/aQ)D 
D2  =  (b,/a^)  +  (bi/a^)D 

D3  =  1  +  (Ci/Co)D 

D4  =  (V^o)  +  (^i/Co)D  .  (A-n) 

and  they  must  be  combined  to  form  a  single  constitutive  equation 

20^830  =  (D2D3  +  0^84)  Cy  .  (A-1 2) 

Now,  from  both  the  right  and  left  sides  of  equation  (A-12)  it  is 
clear  that  the  constitutive  equation  is  of  the  form 

(1  +  aD  +  ¥0^)0^  =  (1  +  C  D  +  dD^)£y  ,  (A-1 3) 

but  it  would  now  be  desireable  to  have  the  form  of  the  standard  linear 
model  as  in  equation  (A-5),  if  possible.  This  can  be  achieved  if  the 
restriction  is  now  made  that 

Di  =  D3  =  1  +  (a^/a^)D  (A-14) 

such  that  equation  (A-12)  now  becomes 

[1  +  (a,/a„)D]c;j,  •  |{(b„/a„  +  d„/C„)  +  (b,/a„  +  d,/CjD]e^  .  (A-15) 

As  a  final  step,  define  the  constants 
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2G*  =  b^/a^ 

with  the  final  form  of  the  constitutive  equation  thus  being 

^  =  (E  +  E  — )ey  , 


(A-16) 


(A-17) 


USING  FAST  TRANSFORMS  TO  COMPUTE  THE 
WEIGHT  DISTRIBUTION  OF  A  LINEAR  CODE 

Bart  F.  Rice 
Department  of  Defense 
Fort  George  G.  Meade,  Maryland 

ABSTRACT.  N.  J.  Patterson,  in  an  unpublished  note,  observed 
that  the  weight  distribution  of  a  linear  code  could  be  computed 
using  a  Fast  Hadamard  Transform.  In  this  paper,  we  expand  on 
Patterson’s  rather  brief  exposition,  providing  a  proof  that  the 
method  actually  produces  the  weight  distribution  and  making^ a 
comparison  of  the  storage  and  time  involved  using  Patterson  s 
method  and  the  "brute  force"  approach. 

The  weight  distribution  of  a  linear  code  contains  a  lot  of 
information  about  the  code,  including  its  minimum  distance  and  the 
probabilities  of  decoding  error  and  failure  if  the  decoding  algorithm 
decodes  all  patterns  of  £t  errors  and  nothing  else  (cf.  [3]).  It 
is  not  surprising,  therefore,  that  there  has  been  much  effort  expended 
in  investigation  of  weight  enumeration  of  linear  codes.  In  the  case 
of  linear  binary  codes,  a  method  for  computing  weight  distributions 
involving  Fast  Hadamard  transforms  [1]  in  an  unpublished  note  by 
N.  J.  Patterson  has  certain  computational  advantages  oyer  the  brute 
force"  technique  of  weight  enumeration  (in  which  a  basis  for  the 
code  is  chosen  and  every  possible  linear  combination  of  the  basis 
codewords  is  taken  in  an  unimaginative  way,  with  the  weight  of  each 
codeword  recorded  as  the  codeword  is  derived) .  In  this  paper  we 
expand  on  Patterson's  rather  brief  discussion,  providing  a 
that  the  method  actually  computes  the  weight  distribution  of  a  linear 
binary  code  and  making  a  comparison  of  this  technique  with  the  brute 
force  approach. 

Let  A  be  a  (n,k)  linear  code  over  GF(q),  with  "weight  enumerator 
polynomial" 

W.(x,y)  =  1  A  X  y  , 

*  i=0 

where  A4  is  the  number  of  codewords  veA  with  weight  w(v)=i.  Let  A-*- 
denote  the  dual  of  A.  MacWilliams'  Identity  states  that 

(1)  1a-‘-1w  (x,y)  =  W  j_(x+(q-l)y,x-y). 

A  A 

If  A  is  a  (n,k)  linear  binary  code  then  (1)  becomes 

(2)  2“"’Sj^(x,y)  =  W^j^(x+y,x-y). 
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For  convonisncG 5  ws  will  assuniG  tlist  A  is  biiia.iry.  TIig  lUGtliod  is 
quite  general,  though,  and  has  obvious  extensions  to  cases  when 
q>2. 

Assume  that  k  >  |,  or  that  A  has  rate  k/n  >  i.  If  k/n  £  i, 
the  following  procedure  should  be  modified  by  interchanging  A  and  A"^ 
and  replacing  k  by  n-k.  Let  H  denote  an  (n-k)xn  parity  check  matrix 
for  A,  say 


H  = 


Vo 

vi 


n-k-1 


where  the  rows  v^,  0<i£n-k-l,  are  vectors  in  GF(2)^  which  constitute 
a  basis  for  A-*-.  Write 


r  t  t  t 

1^-0  Vi  . . . 


“0 

Ul 


U 


n-1  J 


where  u^  ~  ^^io’  ^ii’  ^i,n"-k-l^  binary  (n-k)“tuple 

comprising  the  i-th  row  of  Suppose  0<s<2^  ^-1,  say 

n-k-l  . 

s  =  I  S  2^.  Let 

T=:0  ^  n-k-l 

1  ,  T  s .u .  . 

n—i  n—l  .  1  ii 

1.3  -  Z  -  I 

i=0  i=0 

Notice  that  if  we  define  f:  V  =  GF(2)  ^  C  =  complex  numbers  by 


f(v)  = 


f  1  v=u . 
V  0  othe 


for  some  i,  Oj<i£n-l; 
otherwise. 


then  b  =  f(v)  (-1) 


VS 


veV 


Therefore,  b  is  an  n-dimensional 
s 


Hadamard  transform  [1]  of  f.  Now,  the  vector 
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which  may  be  termed  the  "s-th  codeword”  in  A-*-.  Clearly,  as  s  runs 
through  all  the  integers  from  0  to  V(s)  runs  through  all 

the  codewords  of  A-*-. 


We  have  just  shown  that  s^u^  is  the  i-th  coordinate  of  V(s), 
and  thus  bg  =  the  #  of  0-coordinates  in  V(s)  minus  the  #  of  1-coordinates 

=  n  -  2w(V(s)) . 


Hence  w(V(s))  =  (n-bg)/2.  Thus,  we  can  compute  the  weight  distribution 
of  A*^  (and  subsequently,  using  MacWilli^ms’  Identity,  of  A)  via  the 
Fast  Algorithm; 

Step  0.  Select  a  basis  {vq,  v^,  ...,  for  A  •  Let  B. 

denote  the  coefficients  of  W  initialized  to  0,  ^ 

0£ij<n.  ^ 

Step  1.  Compute  the  "bulges"  b  ,  0^s_<2’^  ^-1,  using  a  Fast  Hadamard 
Transform.  ® 

Step  2.  For  each  s,  0^s^2^  ^“1>  let  i  =  (n-b  )/2  and  replace  B. 

by  l+B^.  ®  ^ 

Step  3.  Use  the  equation  (MacWilliams  1963) 

T  ("‘Sa.  -  2''-'  I  0<r<„  . 

1-0  '  ‘  i-o  ’ 

to  compute  the  coefficients  A^,  0£i£n. 

n“lc 

A  glaring  disadvantage  of  this  method  is  that  all  of  the  2 
bulges  bg  must  be  saved.  If  not  enough  storage  is  available,  then 
the  algorithm  must  be  modified.  The  advantage  is  that  the  work  factor 
of  the  method  is  (n-k)2’^“^.  By  contrast,  the  brute  force  method 
requires  the  computation  of  (sq,  s^,  ..., 

n— k  1 

2  vectors  s  =  (sq,  si,  ...,  eGF(2)^’^.  This  could  be 

accomplished  by  the  following: 

Brute  Force  Algorithm; 

Step  0.  Select  a  basis  {vq,  v^,  ...,  for  A-^ .  Let  B. 

denote  the  coefficients  of  W^x>  initialized  to  0, 

0_<i£n-l,  and  let  s  =  0  =  (0,  0,  ...,  0). 
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Step  1. 


Let  i  =  0,  V  =  (0,  0,  . . . ,  0) 

Step  1.1>  If  Sj^=l,  replace  v  by  v+v^.  When  q>2,  this  requires 

n  additions.  l\fhen  q=2  these  n  additions  can  be  accom¬ 
plished  by  several  mod  2  additions,  the  exact  number 
depending  on  word  size  of  the  machine  used 
to  implement  the  algorithm. 

Step  1.2.  Replace  i  by  1+i. 

Step  1.3.  If  i<n-l,  go  to  step  1.1.  Otherwise,  go 
to  step  2. 


Step  2.  Compute  a  =  weight  of  v  and  replace  B  by  1+B  . 

n— R. 

Step  3.  Replace  s  by  1+s.  If  sj<2  -1,  go  to  step  1.  Otherwise 
stop. 


On  the  average,  the  vectors  s  in  the  Brute  Force  Algorithm 
will  have  density  (n-k)/2.  Thus,  the  work  factor  for  this  algorithm 
is  n(n-k)2^"^'“^ .  That  is,  the  extra  cost  in  time  is  proportional  to 
n.  The  advantage  of  this  method  is,  of  course,  that  the  only  storage 
required  is  for  the  arrays  {A.},  {B.}  and  H.  If  A  is  cyclic,  with 
parity  check  polynomial  h(x)  ^(of  degree  k) ,  then  (regarding  a  vector 
in  GF(2)^  as  a  polynomial  of  degree  £n-l) ,  we  may  take  Vo~h(x), 
Vi=xh(x),  ...,  Vji-k-i  ~  x^”^“^h(x),  so  that  yo  saved. 


(In  this  case,  (s 


Q,  s^,  ...,  =  h(x)  I 

i=0 


X  . 

s.x  .) 
1 


In  conclusion,  using  a  Hadamard  transform  to  compute  the  weight 
distribution  of  a  (n,k)  linear  code  results  in  a  time  saving 
proportional  to  n  at  a  cost  in  storage  of  approximately  2’^~^  words. 
The  technique  is  particularly  advantageous  for  high  rate  codes. 
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FACTORIAL  AND  HADAMARD  SERIES  FOR  BESSEL  FUNCTIONS  OF  ORDERS 

ZERO  AND  ONE 

Alexander  S.  Elder 
Emma  M.  Wineholt 
Propulsion  Division 
US  Army  Ballistic  Research  Laboratory 
Aberdeen  Proving  Ground,  Maryland  21005 

ABSTRACT.  Bessel  functions  of  orders  zero  and  one  for  moderate  and 
large  positive  arguments  have  been  programmed  in  FORTRAN  using  factorial 
series  for  J  (x) ,  Y^(x)  and  K^(x)  and  Hadamard  series  for  A 

subroutine  to  calculate  Stirling  numbers  of  the  first  kind  was  developed 
for  use  in  the  factorial  series.  The  recurrence  relation  was  modified 
and  the  resulting  Stirling  numbers  scaled  so  that  the  entire  range  of 
the  computer  was  utilized;  e.g.,  10  <  S  <  10  instead  of  10  < 

5  <  In  this  way,  more  terms  of  the  series  can  be  calculated  and 

higher  accuracy  obtained.  For  use  in  the  Hadamard  series,  a  sub¬ 
routine  to  calculate  incomplete  gamma  functions  was  developed.  Various 
algorithms  were  necessary  to  encompass  the  required  range  of  arguments. 

These  programs  were  devised  to  verify  the  accuracy  (for  moderate 
and  large  arguments)  of  our  previously  developed  Bessel  function  sub¬ 
routine.  These  programs  replace  the  asymptotic  series  with  convergent 
series,  which,  of  course,  is  desirable.  Extension  of  the  program  to 
complex  arguments  is  now  in  progress. 

1.  INTRODUCTION.  Factorial  series  derived  from  the  Laplace 
integral  converge  rapidly  for  large  values  of  the  argument,  and,)  thus, 
are  preferable  to  the  corresponding  asymptotic  series.  However,  the 
traditional  algorithm  leads  to  very  large  ntimbers  and  must  be  modified 
if  it  is  to  be  useful  for  numerical  work.  One  procedure  for  scaling  the 
large  Stirling  numbers  which  occur  in  the  analysis  is  derived  below. 

Factorial  series  based  on  a  Laplace  integral  evaluated  between 
finite  limits  will  generally  diverge,  so  that  an  alternate  procedure 
is  required.  One  method,  due  to  Hadamard,  is  to  expand  the  Laplace 
integral  in  a  series  of  incomplete  gamma  functions.  The  resulting  series 
converge  rapidly  for  large  values  of  the  argument.  In  practice,  expan¬ 
sions  in  terms  of  the  Kummer  function  are  more  convenient  for  computation. 
These  functions  are  closely  related  to  the  incomplete  gamma  function. 
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Computer  programs  based  on  these  algorithms  will  be  used  to  check 
the  accuracy  of  the  BRL  subroutines  for  Bessel  functions  of  complex 
argument  and  integral  order.  This  is  necessary  as  tables  are  not  avail¬ 
able  for  a  sufficient  range  of  order  and  argument  to  make  a  detailed  check 
by  comparison. 

2 .  FACTORIAL  SERIES .  The  factorial  series  are  used  to  calculate 
KnW,  J^(x),  Y^(x). 

K^(x)  can  be  expressed  in  terms  of  the  Whittaker  function  as^ 

'‘n®  ■  (h) 

where  the  asymptotic  expansion  for  the  Whittaker  function  is^ 


Wo^nf2x)  =  e 


1  +f;  [n^-(-l/2)^]  [n^- [-5/21^1.  .  .  (1/2  -  ml ^1  | 

m=l  m!(2x)”'  ( 


This  asymptotic  expansion  Wcis  derived  from  a  Laplace  integral  evaluated 
between  zero  and  infinity  and  involves  only  negative  integral  powers  of 
the  argument. 


For  n  =  0 


°  '  l!C8x)  2!(8x)^  3!(8x)^ 


„  \l/2  k  A. 

h)  E  ^ 

/  J=0  X*^ 


For  n  =  1, 


Y  (^3  j  1  ,  1‘3  l^-3-5  l^-3^-5-7 


\l/2  k  B. 

S  E  -j 

/  3=0  X-^ 


A  computer  tabulation  of  the  first  fifty  of  these  coefficients  is 
shown  in  Table  I. 


^  Handbook  of  Mathematical  FunoHone,  NBS55^  U.S.  Government  Printing 
Office^  1964y  p.  377. 

2 

Modem  AnolysiSt  E.  J .  Whittaker  and  G.  N.  Watson^  University  Press, 
Cambridge,  England,  2927,  p.  343. 
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I  AH(I)/(I-1)! 

1  O.IOOOOOOOOOOOOOOE  01 

2  -0.125000000000000E  00 

3  0.351562500000000E-01 

4  -0.122070312500000E-01 

5  0.467300415039063E-02 

6  -0.189256668090820E-02 

7  0.795140862464905E-03 

8  -0.342803075909615E-03 

9  0.150645882968092E-03 

10  -0.671862039780535E-04 

11  0.303177745450967E-04 

12  -0.138121266264335E-04 

13  0.6342547730367A6E-05 

14  -0.293202095523644E-05 

15  0.136316535482613E-05 

16  -0.636901146338206E-06 

17  0.298858399233895E-06 

18  -0.140768510711813E-06 

19  0.665283277862541E-07 

20  -0.315364545496475E-07 

21  0.149896710531293E-07 

22  -0.714218736970249E-08 

23  0.341061581781506E-08 

24  -0.163196999789118E-08 

25  0.782339784145318E-09 

26  -0.375679564346582E-09 

27  0.180684642541690E-09 

28  -0*870272909635814E-10 

29  0.419734622392911E-10 

30  -0.202692893602046E-10 

31  0-979963836984338E-11 

32  -0.474303516833861E-11 

33  0.229798664344921E-11 

34  -0.111443911484997E-11 

35  0.540951252872135E-12 

36  -0.262802950502473E-12 

37  0.127776781778835E-12 

38  -0.621733446036719E-13 

39  0.302739840197069E-13 

40  -0.147513520096024E-13 

41  0.719243655405693E-14 

42  -0.3509Q4046930157E-14 

43  0.171299459984542E-14 

44  -0.836694563539963E-15 

45  0.408893411120479E-15 

46  -0.199928685770698E-15 

47  0.978030155285417E-16 

48  -0.478665845012651E-16 

49  0.234372789238237E-16 

50  -0.114807037377268E-16 


BH(I)/(I-n! 

O.IOOOOOOOOOOOOOOE  01 
0.375000000000000E  00 
-0.585937500000000E-01 
0.170898437500000E-01 
-0.600814819335938E-02 
0.231313705444336E-02 
-0.93971 1928367615E-03 
0. 39554201 0664940E-03 
-0.170732000697171E-03 
0.750904632695892E-04 
-0.335091192340542E-04 
0.151275672575224E-04 
-0.689407361996464E-05 
0.316658263165535E-05 
-0.146414056629473E-05 
0.680825363327048E-06 
-0. 31 8139586281 243E-06 
0. 149299935603438E-06 
-0.703299465168972E-07 
0.33241 1277685473E-07 
-0.157583721327770E-07 
0.749058675359042E-08 
-0. 35692491 1166692E-08 
0.170450199779746E-08 
-0.815630838789800E-09 
0.391013424115830E-09 
-0.187770314798227E-09 
0. 9031 13396791 883E-10 
-0.434997699570835E-10 
0.209804924956504E-10 
-0.101318295010245E-10 
0.489854451 812020E-11 
-0. 23709386003 841 lE-11 
0.1 1487295491 5304E-11 
-0.557099051465333E-12 
0.270420427328632E-12 
-0.131376127744436E-12 
0. 63876723907882 OE-13 
-0.310812902602324E-13 
0. 15134504009851 8E-13 
-0. 7374523 55542546E-14 
0.359568344385223E-14 
-0.1754271 57815494E-14 
0- 85638 1494446786E-15 
-0.418293259651984E-15 
0.204421465226220E-15 
-0.999525323533448E-16 
0.488959734152708E-16 
-0. 239306953222 199E-16 
0. 1171741927871 09E-16 


Table  I.  Coefficients  for  Asymptotic  Series 
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These  series  can  be  summed  by  convergent  factorial  series  using  an 
algorithm  described  by  Wasow: 


00  r 


r-p+l _ _ 

r^-1  x(x+l)(x+2)  .  .  .  (x+r) 


where  r  denotes  the  Stirling  numbers  of  the  first  kind 


/ 

Now,  K  (x)  =  ^  e"^  S 

o  ^  \  2x  /  o 


where  S  =  1  -  +  T 

o  1 ! (8x}  o 

k  A.  A„  A„ 

=  .  . 

3=2  X-’  X  X 

Applying  Wasow' s  algorithm  to  these  terms, 

_ _ 

-2  2lx(x+l)  x(x+l)(x+2)  x(x+l)  (x+2)  (x+3) 


*3  .  .  1^0  ,  b'  .  b' 

^3  3lx(x+l)  x(x+l)(x+2)  x(x+l)(x+2)(x+3)  * 


Therefore,  T^  can  be  expressed  as 


T  =  y  ■ 
o  xfx+1 


(x+1)  .  .  .  (x+r) 


where  V  =  ,  +  A.,  ^  +  A,  + 

o,r  2  r-1  5  r-2  4  r-3 


These  coefficients  can  be  calculated  and  stored  in  the  memory  of  the 
computer  for  recall  on  demand. 

The  calculations  for  these  coefficients,  involving  Stirling  numbers, 
lead  to  very  large  numbers  in  the  computation  of  high-order  terms. 

Since  the  Stirling  numbers  are  always  greater  than  or  equal  to  one, 
we  modified  them  for  optimal  use  of  the  full  range  of  the  computer. 


Asymptotic  Expansions  for  Ordinary  Differential  Equations^  W.  Wasow^ 
Interscience  Publishers^  Joh?^  Wiley^  NY^  1965^  p.  330, 
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The  Stirling  numbers  were  modified  in  the  following  way: 

125 

sj  =  F,  F  =  scale  factor,  such  as  10 

S"  =  sJ‘V(n-l) 

The  scale  factor  and  the  number  of  modified  Stirling  numbers  which  can 
be  calculated  are  machine-dependent.  The  computers  at  BRL  have  a  range 

from  to  10^^^,  single  precision,  which  is  larger  than  the  range 

of  most  computers.  As  can  be  seen  from  Table  II,  for^F^-  10 

n  =  150,  the  modified  Stirling  numbers  range  from  10”  to  10  .  The 

process  of  scaling  the  Stirling  numbers  in  this  way  must  then  be  reversed 
in  calculating  each  term  of  the  factorial  series. 

By  this  transformation,  we  obtained  accurate  results  (15  significant 
digits)  for  x  >  6  by  summing  150  terms.  Similar  accuracy  could  be  ob¬ 
tained  on  most^computers  using  double  precision. 

/tp  \1/2  _x 

Similarly,  K^(x)  =  e 


=  1  + 

rs 

l!(8x) 

00 

"l,r 

-  Z-r 
r=l 

x(x+l) 

where  =  B2r^_j  +  B3r^_2  +  • 

The  results  for  were  equally  accurate. 

The  asymptotic  series  for  the  ordinary  Bessel  functions,  x  ^  25, 


JoW 


[P  (x)  cos(x  -  j)  -  Q-(x)  sin(x  -  J)] 


Jj(x) 


— [Pj(x)  cos(x  -  ■^)  -  QjCx)  sin(x  -  ^)] 


4  Bessel  Functions,  Part  I,  published  hy  British  Association  for  the 
Advancement  of  Science,  University  Press,  Cambr^dge,  England.,  1957, 

p,  202. 
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1  0.2625^1431038901-135 

4  0.594416307877133-124 

7  0.620095188981095-114 

10  0.141690995161720-104 

13  0. 113521536148685E-95 

16  0.4031C5100240183E-87 

19  0.730187886089758E-79 

22  0.740917060108629E-‘71 

25  0.4501C2130711339E-63 

28  0. 172006626264944E-55 

31  0.429520654391673E-48 

34  0.722149952063180E-41 

37  0.837326275585929E-34 

40  0.682727806061237E-27 

43  0.3977589435815735-20 

46  0.167774C15729906E-13 

49  0.517937210655300E-07 

52  0. 11807577U65955E  00 

55  0.200240793190783E  06 

58  0.254107442106383E  12 

61  0.242426888397106E  18 

64  0. 174495356761975E  24 

67  0.950002380342511E  29 

70  0.391821615456175E  35 

73  0. 1225C8995716187E  41 

76  0.290319945584933E  46 

79  0.520899483162194E  51 

82  0.706254632117985E  56 

85  0.721564602846948E  61 

88  0.553438118324812E  66 

91  0. 317167294679423E  71 

94  0. 135325685209439E  76 

97  0.424C6C880287624E  80 

100  0.974387656698C37E  84 

103  0.162215177246175E  89 

106  0.193432814981343E  93 

109  0.1630G54839C6049E  97 

112  0.955495084811848+100 

115  0.382364608112042+104 

118  0.102158281160150+108 

121  0.177415994653956+111 

124  0.193865171243942+114 

127  0.128034645087299+117 

130  0.485782753348748+119 

133  0.991494859813093+121 

136  0.996588263274595+123 

139  0.435467363384560+125 

142  0. 684403248787668+126 

145  0.279759712120229+127 

148  0.147742753080902+127 


0.293390049185972-131 
0.161631807256184-120 
0.937263899085794-111 
0.145745824899596-101 
0.872724632875019E-93 
0. 24448592 1 578491 E-84 
0.361 8781 83982837E-76 
0.3075061226256045-68 
0.1 592 90231850353 E-60 
0.526244225186191E-53 
0.1 14831 11 53291 36E-45 
0. 170161 172544539 E-38 
0.175105193623330E-31 
0. 127434483546970E-24 
0. 66576645797338 lE-18 
0.252791612712235E-11 
0.7047435844435065-05 
0.145466542291410c  02 
0.223832004301461E  08 
0.258158475955319E  14 
0.224137435817219E  20 
0.146958814419687E  26 
0.729268853726516E  31 
0.274247724384982E  37 
0.781856783017240E  42 
0.16e899572458664E  48 
0.276096523091475E  53 
0.3407683702040505  58 
0.316568545939851E  63 
0.22C455227036434E  68 
0.114501970441310E  73 
0.440823733512481E  77 
0.124873374341855E  82 
0.258006379428166E  86 
0.384836375802625E  90 
0.40940765901B037E  94 
0.306267211155246E  98 
0.158431163016062+102 
0.555606274242301+105 
0.129004865734295+109 
0.192739054938949+112 
0.178944908244716+115 
0.988621557189148+117 
0.307581787025027+120 
0.501359095737511+122 
0.388006214031004+124 
0.123698885459199+126 
0.129991588080456+127 
0.300552651141317+127 
0.558451392197723+126 


0.162469629570885-127 
0.348403288776085-117 
0.122805989253782-107 
0. 1349946815H499E-98 
0.6 169648 6244895 3E-90 
0. 138 176751796042 E-81 
0. 168651283284991E-73 
0. 1208 107250 11220 E-65 
0.536288624778063E-58 
0. 1 537591 78 19021 5 E-50 
0.29.4089524864916E-43 
0.385045773882694E-36 
0.35 2369350644976 E-2 9 
0.229266634594032E-22 
0.107555387883677E-15 
0.36 80433898 3 1136E-09 
0.927441656806729E-03 
0.173458453446450E  04 
0.242317917476252E  10 
0.254129580523208E  16 
0.200863910466421E  22 
0.  119996196936958E  28 
0.542840153293270E  33 
0. 186138969273117E  39 
0.483840847139701E  44 
0.952645306311109E  49 
0.141844276858788E  55 
0.  159312784381071E  60 
0. 134510966747108E  65 
0.850010949437138E  69 
0.399846121274463E  74 
0. 139095522997561E  79 
0.355050280369826E  83 
0. 658888038133901E  87 
0.879348340649532E  91 
0.833293416906864E  95 
0.552343491325650E  99 
0.251599210263367+103 
0.771214486559342+106 
0. 155126469001113+110 
0.198618366362429+113 
0.155930607458550+116 
0.716280594664574+118 
0.181290891228498+121 
0.233459123119021+123 
0.136972391170442+125 
0.311019186727229+126 
0.209419799774947+127 
0.248534593164330+127 
0.100000000000000+126 


Table  II.  Modified  Stirling  Numbers  for  n  =  150 
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[P^(x)  sin(x  -  x)  +  cos(x  -  x)] 


37r. 


377. 


2  2  2  2 
13*57 


k  C. 

=  E~ 


J=0  X 


2j 


2  2  2  2 

and  Q  (x)'v.  -  ■ -.4-0— V  +  — — I- 
^  1 !  (8x) 


3!  (8x) 


k  D. 

=  y 

Lj  9-i 


J=0  X 


2j+l 


2  2  2  2  2 
1^'3  '5  .7  ‘9 

5! (8x)^ 


Note  that  =  |A^ | ,  =  - Ia^ | ,  .  .  .  ,  =  (-1)^ |A2j 

and  =  -|Aj^|,  =  Ia^I)  .  .  •  ,  =  (-1)^  l^2j+l^ 


Similarly, 


E 

p,  E  4. 


1  “  2j 

3=0  X 


4  F- 

and  Qj  a.  E 

3=0  X 


And,  again,  =  |b^1,  =  -IB2I,  .  .  .  ,  E^  =  (-l)^|B2j 


F„  -  iBjl,  Fj  =  -IB3I . F  -  (-1)1|B 


23+1 


For  the  ordinary  Bessel  functions,  x  >  25, 
J^(x)  =  G(x)  sin(x)  +  H(x)  cos(x) 

Jj^(x)  =  M(x)  sin(x)  -  N(x)  cos(x) 

Y^(x)  =  H(x)  sin(x)  -  G(x)  cos(x) 

Y^(x)  =  -N(x)  sin(x)  -  M(x)  cos(x) 
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where  G (x) 
H(x) 
M(x) 
N(x) 


(ttx)  [PqW-QqCx)] 

(ttx)-^/2  [p^(x)+Q^(x)] 

(ux)-^/2  [P^(x)+Qj(x)] 
(Trx)'^/2  [P^(x)-Q^(x)] 


manner!  ^  ”*e’^ely  arranged  in  a  different 

a-i  ^  results  obtained  were  accurate  to  15  significant 

gits  for  X  ^  6  by  summing  150  terms.  A  sample  tabulation  of  the 
ordinary  Bessel  functions  from  the  computer  is  shown  in  Table  IH. 

seriefdSS!' 

3^WMMARD_JE^  The  factorial  series  for  calculating  I  (x) 

»as  used  i„?S  Md‘‘LTi«:-p4ra™er"‘‘  ''■ 

I„(x)  can  be  expressed  by:^ 


I  ('x')  = 
n*-  rrn+1 


n 


r(n+l/2)r(l/2) 


I 


X  cos9  .  2n 
e  sin  '  0  de 


“efbfStl:;  “'"S'-”-"’  '»-  --es  can 


e^  (2x1'^''^  ^  (1/2-n)^  Y(n+m+l/2,  2x) 

“  rCn+l/2)r(l/2)  m=o  ~  .m - - 

m! (2x) 


where  y  denotes  the  incomplete  gamma  function  and  (1/2-n)  denotes 
Pochhammer's  symbol.  '^m 


5 

6 


Theory  of  Bessel  Functions.  2nd  Ed.  ,  G.  N 
N.Y.^  1948^  p.  204. 

Handhooh  of  MathematicaZ  Funeti-ons.  NBS  .8.8 
Office^  1964^  pp.  262,  504.  ’’ 


Watson,  Macmillan  Co., 
U.S.  Government  Printing 
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(a)n  =  a(a+l)(a+2)  .  .  .  (a+n+1),  (a)^  =  1 

52.r  s 

Y(a,x)  =  a’l  e""'  M(l,  l+a,  x)  , 


where  M  denotes  the  Kummer  function. 

Hence,  after  substituting  and  simplifying,  we  have 


I  (x) 


e _ (2x) 

r(n+l/2)r(l/2) 


M  (1,  n+m+3/2,  2x) 
Cn+m+1/2)  m!  “ 


?he*reqS™fac“raJr  "hfSl  r"’'  w's/t^get 

have  the  corrert  con^r  JJt  t» 

shown  in  Table  IV.  ^  computer  tabulation  is 

we  had''LJS!“'we"Stained  ?rsigMficmr5igits°f„rx'>l7 


c 

l2 

1  + 

2  2 

,  1 

(2Trx)^/2  j 

1 !  (8x) 

2!  (8x)^ 

+  — - 

3!  (8x)^ 

X 

c 

-|  /  r\ 

1  -  1'^ 

l^'3-5 

2  2 

1  3^-5-: 

(27TX)  I 

1 !  (8x) 

2! (8x)^ 

3! (8x)^ 

Significant 

Besides  the  immediate  value  in  verifying  the  accuracv  of  tb. 
differential  equations.  solution  of  ordinary 


Reference  4,  p.  271. 
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introduction 

It  1.  veil  known  thnt  ty  givnn  sufficient  spectral  4ata,  the  entries 
.f  a  continued  fraction  expansion  relate  intimately  to  the  density  func¬ 
tion  of  the  inverse  sturt^Uouvllle  problen.^.^  The  investigation  of  the 
pole-aero  distribution  of  a  continued  fraction  with  each  of  its  entries  a 
different  c<»plex  function  is  significant  because  of  the  simple  implementa. 
tion  analytically  and  numerically.  However,  although  traced  back  to  19th 
century,  the  literature  shows  very  little  of  this  kind  of  study.  A 
recent  paper  by  lee  and  Hrown.=  sheds  some  light  on  the  pole-aero  distri¬ 
bution  pattern  of  the  immitance  function  of  finite  inhomogeneous  Udder 
networks  by  using  the  chain  matrix  parameter  method. 

In  this  paper,  continued  fractions  with  complex  function  entries  are 
first  studied  in  a  general  setting.  The  pole-aero  distribution  region 
is  described  by  a  conventional  root  locus  equation  and  is  found  to  be 
bounded  in  the  corresponding  complex  plane.  The  applications 
theorems  are  illustrated  by  examples. 

preliminary  DEFINITIONS 

let  R+  denote  the  positive  real  line.  C  the  whole  complex  plane  and 
P.R.  a.w  positive  real  rational  function.  Two  polynomUls  are  said  to 


be  relative  prime  polynomials  or  simply  r.p.,  if  they  do  not  have  any 
comruon  factor. 


A, 


The  set  of  arcs 


in  C  satisfying  the  root  locus'^  equation 


1  +  —  =  0,  ke  |r\-oI  , 

G(s)  I  / 


is  denoted  by  |g(s), 
zeros  of  g(s)  at  k  = 

B.  Suppose  P(s) 


Therefore, 


C-(s),  k 


starts  from 


0  and  ends  at  the  set  of  poles  of  g(s) 


N  m 

n  G(s)  -f  p.  and 
P  i=l  L  ij 


the  set  of 
at  k  =  03. 


N(or  N-l) 

Q(s)  =  I^G(s)  +  qj  . 

If  0  <  p.  <  <  P^^j^Vi  -  1,  then  the  zeros  of  p(s)  and 

Q(s)  are  said  to  alternate  with  respect  to  fcCs),  k]  .  The  zeros  of 
P(aj)  and  Q(aj)  alternate  on  the  negative  a;  =  g(s)  axis  of  the  cu-plane  and 
thus  the  zeros  of  p(s)  and  Q(s)  alternate  along  each  locii  of 


1  + 


G(s) 


0,  ks  |r  UO^  in  s-plane. 


C.  We  shall  denote  the  following  continued  fraction  expansion. 


or  C.F.,  by  F., 
Is 


gjy(s)]  if 


■  "k  +  - 


Y„  +  Z  V  +  Y  + 
N  N-l  N-l 


±  ±  ± 

+  Z.  +  y.  -f***4-2  *fY 
1  1  1^1 

(1) 
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„he«  Z.  -  t,  -  sMs).  Vi-  1.  2,-",  K.  are  the  eatrles  of  the 

1  1  i. 

C.F.  and  z(s),  y(s)  are  two  different  complex  functions  of  s. 

10  DISTRIBUTION  OF  FINITE  CONTniUED  FRACTION  OF  ARBITRARY  COMPLEX 


POLE- ZERO 
FUNCTION  ENTRIES 

Consider  C.F.  F 
sides  by  y(s)  yields 


N 


f.z(s),  g.y(s)]  =Aj^/Cj^,  then  multiplying  both 


Ajj/y(s)“^Cj^,=  f-cu+  — 


(2) 


where  a,  =  z(s)y(s).  Therefore. A^  and  y“^C^  are  functions  of  o). 

1;  In  the  C.F.  F^  [fj^z(s),  S^yCs)]  =  if  f^^,  S^eR  ,  Vi-  1, 


Lemma 

N,  then 
a  :  the  zeros 


of  A  (m)  and  y"^C^  feu)  interlace  on  the  negative  real  axis 


-N'-'  '  N 


of  cu-plane  x.ith  0  <  a.  <  v.  <  for  i  =  1,  2,...,  N  -  1.  where  -  a. 

and  -  y.  are  the  zeros  of  A^,(u))  and  y'^Cj^Ceu)  respectively, 

1  ^ 

^  '  V“^'u)=0  '  ^  S^“^'cu=0  "  i?!  ‘ 

Proof:  By  elementary  property  of  two- element- kind  R-L  ladder  networks, 

a  follows  immediately  from  the  expression  of  (2).  To  show  b^,  mathe¬ 
matical  induction  is  used.  Suppose  the  expression  holds  for  N  =  n  case, 

then 


n+l 


Sn+l  +Aja))/y"^cjcu) 
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which  yields. 


+A^(o,) 


■  s„+iA„(»)  +y"^cj») 


Hence, 


Vi^Uo- ^  ’ 


-lo  ^  Nt 

y  Vi  (»>  L-O  ■  S„+l  +  h~  ^  h  ■ 

1=1  i=l 


This  completes  the  proof  of  the  lemma. 


The  following  corollary  is  the  direct  consequence  of  the  above 


lemma  • 


Corollary  1:  If  the  same  hypothesis  of  the  foregoing  lemma  holds  for  the 

S^yCs)  ,  then 


a^:  the  zeros  of  A^^Cs)  and  Cj^(s)  alternate  with  respect  to  [2(s)y(s),  k]. 


V=>l,  I  ,  w  ,  ,  -  1  yCsr^cJs)  I 

|slz(s)y(s)  =  of  N 

The  following  facts  are  observed 


N 

=  E  g. 

I s Iz(s)y(s)  =  0  [  i=l  ^ 


Consider  [f.^Cs),  g.y(s)  J  =  Aj^,(s)/c^/s) ,  than 


N  \-l  N 

n  n 


N  r  / 

n  z(s)y(s)  +  a.  , 
i=l  L 
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/  N- 1  \  -  1  /  N  \  N- 1  p 

-V,(s)-  n  V,  Si  n 

\  i=l  /  \  i=l  /  i-1 


(i^) 


b  :  if  zU)  -  n  (s)/d  (s)  and  y(s)  -  n^(a)/djs).  where  n^(s)  and  djs). 
3  a  a 

xi^is)  and  d^{s)  are  r.p.,  then  we  have 

Case  1;  n  >  m,  where  n  =  degree  of  (njs)nj^(s))  ,  m  =  degree  of 
(d,(a)d^(a))  A„(s)  =  (1^  ==,)  '  £ 


and 


Cw(s) 


/N-1  \-l  /  N  \  n(N-l)  .  \ 


where  z  and  z  alternate  with  respect  to  f  nj  s)n^^(s)/d^(s)d^(  s) ,  kj  . 

Case  2:  for  m  >  n,  then  the  above  explicit  forms  remain  the  same  except 
for  the  upper  running  indices  of  the  product  of  the  factors,  using  m 

instead  of  n. 

In  what  follows  the  decomposition  theorem  pertinent  to  the  syn¬ 
thesis  of  a  finite  ladder  network  is  established. 

Theorem  1:  Let  z(s)  =  A(s)/c(s)  be  a  rational  function. 

y{=)  -  n^(s)/d^(s),  n^(s)  and  d^(s),  n^(s)  and  dj_(a)  are  r.p..  iff. 

A(s)  and  c(s)  satisfy  the  following  conditions, 

a  :  the  zeros  of  A(s)  and  y(s)"^  c(s)  alternate  with  respect  to 
|z(s)y(s),  kj  , 
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{s|n^(s)nj^(s)  =  of 


=  E  g,  >  0 


|s!n^Cs)nj^(s)  =  Of 


c^:  for  n  >  m,  n  =  degree  of  (n^(s)n^(s) ) ,  m  =  degree  of  (d  (s)djs)), 

A(s)  and  y(s)“  c(s)  are  polynomials  of  degree  nN  and  n(N-l);  for  m  >  n, 
A(s)  and  y(s)  ^c(s)  are  polynomials  of  degree  mN. 

Proof.  The  "only  if"  part:  It  follows  trivially  from  Lemma  1  and  its 
corollary. 

The  if  part:  Let  A(s)  and  c(s)  satisfy  condition  a  through  c  .  It 

4  4* 

follows  from  definition  B, 

A(s)  a.j  [\(s)’^(s)  +a.djs)d^(s)j 


N-l  \-l  /  N 


c(s)  =^_n^  Yij  I  gij%(s)<ajs)  [njs)r^(s)  +  YidJs)d^(s)J  , 


Xvhere 


0  <  a 


.  <  Y-  <  Q:.  >  Vi  =  1  2  '  •  • 

1  1+1’  , 


N-l 


Therefore, 


A(s)/c(s)  = 


N  -1  \  N 


I  /  A 
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which  yields. 


and 

a^,  b-^  >  0,  Vi  . 


Write, 


A(a3)/y  c(a))  = 

^  JU 

]-^  ’ 

vhere 

=  V 

-l^^N  [®N-l'^®N  “ 

b  ^/b  1  >  0  , 
^2'  N  J  ’ 

and 

=  a((u)  - 

^  c(tu) 

„  N-1  . 

=  a*_^oj  +  ••' 

*  +  a*  0)  +  1  , 

(5) 

y"^  C-i!-  =  y"^  c(a))  - 

=  b't  .  +  •  •  " 

N-c: 

+  b*  u)  , 

(6) 

then 

b^:  >  0  . 

X 

Moreover,  the  interlacing  zeros  of  kiw)  and  y  ^C(u))  in  the  negative  real 
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axis  of  cu-plane  implies  that  the  zeros  of  and  y"  ^  C'”' alternate  on  the 
same  negative  real  axis,  by  hypothesis  a^  through  and  Fig.  1  shows  the 

locations  of  the  zeros  of  A-.  Therefore,  the  zeros  of  y“^c^:-  and  A^^ 

interlace  on  the  negative  cr-axis  starting  with  the  first  zero  belonging 

to  A  as  shoxra  in  Fig.  2  following  the  same  argument.  Hence,  and  y"^C^‘ 
can  then  be  wit  ten  as 

N-1 


=  V  n  (»+«) . 


and 


y"  V  =  k 


N-2 


+  "Vi)  >  1  -  1.  2,  •  •  •,  N  -  1,  0  <  a  <  Y.  <0!.  , 

1-1  1  *1  l+l  » 


where 


followed  from 


N-1  \  -1 

A  V  ’ 


I  -  A{®)  -  f  a:y"'c(o))|  =  1 

®  =  0  '„,  =  0 


and 


followed  from 


/N-2  \-l/N-l 

=1  n  Yi'  1  12  g. 


i=i  ^ 


i=l 


y"V!  =  y"^c(cu)  -  g^^-: 

cju-O 


N 


K-1 


«-0  "  1=1  -  ifi 
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It  is  easily  seen  that  and  satisfy  conditions  through 
by  simply  substituting  back  u)=z(s)y(s)  to  A’"'  and  except  for  the 

degree  of  aHs)  and  y"^C^'(s)  are  n  or  m  degree  less  than  that  of  corre¬ 
sponding  degree  of  A(s)  and  y(s)"Vs)  respectively. 

Therefore,  this  process  is  continued  until  N  =  1,  in  this  case 

A^(u))/y”^Cj^(a))  =  aj^u)  +  1/c^  , 

where  ^  ^1  ~  ®1  ^  ^  ' 

Q  •£  •D  • 

BOUNDEDNESS  OF  THE  MODULUS  OF  ZEROS  AND  POLES  OF  F^^  |^f^z(s),  gj^y(s)| 

In  \Aiat  follows  the  uniform  bound  is  found. 

Lemma  2:  .  In  the  C.F.  F^^  [f^z(s),  g^y(s)]  ,  if  Vi,  then 

^/^a){y(s)F^[f,z(s),  8iy(=)]},(s)y(s)=^.>  '^*  ’^“*  and  O)  ^  vhere 

Y.  are  the  poles  of  y(s)Fjj  [f.zCs),  g.y(s)]  2(s)y(s)=oj  cu-plane. 

Proof:  straightforward  computation  shows 


B/9a) 


,  3/to 

^n{ 

% 

Vi 

1 

1 

[°N 

yFj^,_l(a)) 

+ 

‘j 

^2 

1 

u)  y  y^  > 


where  -  Y-  are  the  poles  of  yF  (cu).  The  foregoing  relation  implies  that 


are  triv- 


if  a/3ci)[yF^_l((«)]  >  0,  then  >  0.  But  N  =  1,  2 

ially  true  and  hence  the  leinma. 

Lemma  5:  m  the  C.F.  F^^  [f^z(s),  g.y(s)]  ,  if  f.>  g^eR"^,  Vi,  then 
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S/3f 


and 


3/Sgi{y(s)r^[f^.(3).  V(  =  )JK(,W3)  .„<0.V». 

real  and  nonpositive. 

Proof:  It  follows  from  (2), 

<0.  Vcs-cR-^, 

[yV“)]  =  -  r - ^-r-  <  0,  Vco  . 

Let 


yF*(a))  =  g.  + - —  — i 


1  1 


f^_l(u+  gj_^+ - +  f^U,+ 


Then,  simple  computation  j'ields,  for  any  k  <  N 


5/3fk  1)^^^'^^ 


(D 


[^^n(  ]  [y^N- 1  ( ]  ^  *  •  •  yFj^(  co)  J 


3/3gl^  _  1 _ _ 

[yF^Cco)]  . .  ['yFjo))  ]  ^  [yF|>-(a))  Y 

It  IS  obvious  that  the  right  hand  sides  of  the  above  equations 
nonpositive  for  ihis  completes  the  proof  of  the  lemma. 


(7) 


are 
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Theorem  2:  Let  there  be  two  C.F,,  |^f^z(s),  g^y(s)  j  and  |f’’^z(s), 

g*y(s)j  .  If  >  f|eR’’',  >  g^eR"^,  i  <  N,  then  Vi  <  K  and 

V  <  Y*S  Vj  <  N  -  1,  where  -  a.,  -  a-^  and  -  Y  ,  -  Y^-  are  the  zeros  and 
j  11  J  J 

the  poles  of  yF^(u))  and  yF|^(a))  respectively* 

Proof;  Since  we  have  by  Lemma  2,  yF^.(i'  is  a  monotonically  increasing 

ri 

function  of  cu  7^  -  y^t  by  Lemma  5>  same  function  is  an  nonincreasing 

function  of  f^  and  g^^,  Vco,  real  and  nonpositive.  Therefore  all  the 

zeros  of  yFjj(co)  shift  to  the  right  on  the  real  U)-axis  as  all  the  entries 

f  and  g.  increase  in  value,  as  shown  in  Fig.  5-  This  gives  CX.  <  OC^  , 

1  1 

Vi  <  N.  The  result  of  the  poles  of  yF^.(a))  and  yF*(u))  follows  by  using 
the  same  argument  to  the  function  of  [yFj^(cu)l  ^  and  it  is  omitted  here. 


Q.E.D 


It  is  noted  that  if  the  entries  of  the  C.F.  F^^  |^f^z(s),  g^y(s)  j  = 
Ajj/Cjj  are  uniform,  f^  =  feR*^,  g^  =  gsR~,  i  <  N,  then  we  have 

Ajj(uj)  =  sinh  (N  +  1)  a  (w)  -  sinh  Na(cu)/sinh  a(a))  (i 


y~^C„(u))  =  g  sinh  Na('j;)/sinh  a(u))  , 
N 


vdiere 

cosh  a(u))  ~  2  +  fgcu/2  . 

Lemma  4;  Let  the  entries  of  the  C.F.  |^f^z(s),  gjy(s)j  =  A^/C^  be 
uniform  as  defined  above,  then 
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“  2  {  1  -  cos 


)/ 


fg,  Vk  <  N  -  1 


cos  /fg  gA  _  krr 

2  N  J  \  N 


fg,  Vk  <  N.  (11) 


Proof:  Substituting  the  follcn^ing  identities  into  (8)  and  (9), 


cosh-  Na  »  2 


[ 

k=l  L 


cosh  a  -  cos 


(2  k-  Dtt 


sinh  Na  =  2  sinh  a  n  cosh  a  -  cos 


results  in  a^  and  of  the  lemmk  by  using  the  same  argument  as  in 
Theorem  1  concerning  to  the  sum  of  two  polynomials  with  interlacing-  zeros 


on  the  real  axis- 


As  a  consequence,  the  following  theorem.is  established. 
Theorem  5:  Let  -  a^,  -  y.  be  the  zeros  and  the  poles  of  the  C.F. 

Si^®^]  IzCs)y(s)  =  to-  ^i’  then 


0  <  2 


1  -  COS 


(2  k  -  1 


*1  Vk  <N-  1  , 


where 


inf  g  , 
i<N  ^ 


sup  f.  and 
i<N  ^ 


sup  g. 
i<N  ^ 
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Proof:  The  above  result  follows  itnmediately  from  Theorem  2  and  Lemma 

since  the  zeros  and  the  poles  of  the  C.F.  y(s)Fj^  |^f^z(s), 

V  res')!  I  /  ^  /  N  shift  to  the  right  on  the  negative  real  u)-axis  by 
J  I  z(s)y(s)  =  CD 

the  increasing  in  value  of  all  its  entries  g^Vi.  Therefore,  these 

zeros  and  poles  are  bounded  in  modulus  by  that  of  the  zeros  and  the  poles 

of  the  two  corresponding  C.F.  of  uniform  entries  each  with  f  ~  inf  f^, 

i<N 

g  =  inf  g.  and  f  =  sup  f.,  g  =  sup  g.,  respectively. 
i<N  ^  i<N  i<N 

Q.E*D. 

ASYMPTOTIC  DISTRIBUTION  OF  THE  POLES  AND  THE  ZEROS  OF  THE  SEQUENCE  OF 
THE  CONTINUED  FRACTIONS 

Let  |Fjj  [^f^z(s),  jjgjy(s)j  j-  be  defined  as  a  sequence  of  C.F.  for 

N  =  1,  Now  for  each  fixed  N,  the  entries  of  the  corresponding 

C.F.  are  N®i  “  N^i''^’  ^  follows  the  re¬ 

sult  pertaining  to  the  integrated  networks  are  derived. 

Theorem  4;  If  j^,a^  and  ^^c^,  for  all  N  and  i,  of  the  above  defined 
sequence  are  bounded  away  from  zero,  then  and  )  for 

sufficient  large  N  and  k,  where  -  zero  and 

pole  of  the  corresponding  C.F.  s) ,  j^g^yCs)]  in  the  cd  =  z(s)y(s) 

plane. 

Proof:  Since  and  ^c^  are  bounded  away  from  zero  for  all  N  and  i, 
we  choose 

a  =  sup  ^a.  for  all  N  and  c  =  sup  c  for  all  N  . 
i<N 
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Hence  a  sequence  of  uni 


Lform  C.F.  [ 


and  -  c/W  has  the  following  relationship  by  Lemma  4, 


cos  ^  a  ■  >=  <N  -  1  , 


where  and  are  the  kth  zero  and  pole  of  the  uniform  C*F.  F^^f^(s), 
in  the  (w-plane. 

It  follows  from  Theorem  3,  we  have. 


(2  k- 


Vn,u<k.. 

N  J  ac 


The  conclusion  of  the  theorem  follows 


Q.E.D. 

REMARK 

Theorem  k  is  used  to  investigate  the  asymptotic  behavior  of  the 
zeros  and  the  poles  of  the  nonuniform  C.F.  in  cu- plane  as  well  as  the 
convergence  of  Aj^(a))  and  y  C^(cd)  as  N  ,  It  follows  in  particular 
that  if  j^f^  =  N^i»  away  from  zero, 

then  and  =  0(n^ ) ,  as  N  where  ^Yj^,  are  the  zeros  and  the 

poles  of  the  corresponding  C.F.  This  result  consistent  with  the  result 
obtained  from  solving  the  transmission  line  equations  for  the  distributed 
networks . 
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EXAMPLES  AND  APPLICATIONS 


Example  1:  Let  the  entries  of  the  C.F.  be  {fy  fg’  (6/15, 

8/5,  i)-8/5)  and  {%y  g2»  S^)  =  (5/8,  5/1^,  1/1=) ;  z(s)  =  s/(s  -  1)  and 
y( s)  “  l/(s  -  l)  • 

Sitt^le  computation  yields 

F^j^fj^z(s),  g^y(s)j  =  “ 

8fiq  -  67  +  lh2  S^  -  179  +  lh2  ^  -  6?  s  +151^ 

15(  8  s^  -  5^  +  53  -  65  +  3^  s  -  8) 

and  [z(s)y(s),  k]  =  [s/(s  -  l)^».k]  ^diich  satisfies  the  root  locus 

2 

equation  of  1  +  ~  =  0,  ke  |oUr'^J  and  is  shown  in  Fig.  k. 

s 

It  follows  that 

=(l/l5)[s  -  (1  +  j|3/2)]  [s  -  (1  -  j>[3/2)]  [s  -  (5  +  j  ^lll/8)] 

[s  -  (5  -  j  ill/6)]  [s  -  (9  +  j  Ni9/lo)J  1.S  -  j  fi9/io)] 

y-^c^  =(l/8)[s  -  (5  +  j\|T/‘^)][s  -  (5  -  jiT/Mj[s  -  (T  +  jfi5/8)] 

[s  -  (7  -  3  {15/8)]  . 

The  zeros  of  A^  and  y~\  are  shown  in  Fig.  5.  As  can  be  seen  that 

they  alternate  vrith  respect  to  [^s/(s  -  l)“,  kj  . 

In  example  given  below.  Theorem  1  is  used  to  realize  a  ladder  net- 

\jork  with  a  given  immitance  function. 
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Example  2:  Let  the  poles  and  the  zeros  of  a  driving  point  impedance 
Z(s)  be  specified  at  -  1,  -  2,  -  J  +  j  \jy/2,  -3+3  and  at  -  5  + 

j  'I5/2,  -  3  +  j  \j  11/2,  -3+3  '1 19/2,  respectively. 

Synthesis  procedures: 

1)  Construct  the  pole-zero  plot  for  z(s),  as  shown  in  Fig.  6. 

2)  Find  an  arc  as  shovm  in  Fig.  7  passing  through  all  these  sirn 
gularities.  This  arc  is  described  by  [(s  +  1)  (s  +2),  kj  by  inspection 
hence,  let  z(s)  =  s  +  1  and  y(s)  =  s  +  2. 

3)  Multiplying  out,  results  in 

k  (s^  +  9  +  42  s^  +  117  s^  +  206  s^  +  213  s  +  105) 

Z(s)  - -  .  . 

k^(  s^  +  8  s^  +  3 1  +  68  +  84  s  +  48) 

4)  Since  A(s)|  .=  1,  yields  k  =  I/I5,  and 

‘|s!(s  +  1)  (s  +2)  =  o|  ^ 

(s  +2)  c(s)I  ^  =  4,  yields  k  =  1/2  (note  that  the 

|sl(s  +  l)(s  +2)  =  0} 

number  4  is  arbitrarily  assumed  which  happens  to  be  the  total  capacitance 
of  the  ladder  network.),  therefore,  we  have 

(s+2)z(s)|  =  g.(4)  +  g)(a)  +  3) 

(s  +  l)(s  +  2)  =  0)  13((u  +  2)(u)  +  4) 

5)  Hence  C.F.  gives  (f^,  f^,  f^)  =  (2/15,  6/15,  24/15)  and  (g^, 

^2’  ~  (15/6,  15/12,  1/4).  Fig.  8  shows  the  corresponding  network. 
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CONCLUSION  AND  SUMMARY 


The  complete  pole-zero  pattern  of  a  continued  fraction  of  nonuniform 
entries  is  established  using  arcs  in  the  s-plane  defined  by  a  simple  root 
locus  method* 

^  pj-Qcess  of  decomposition,  of  mtional  functions  satisfying  the 
foregoing  pole- zero  patterns  into  continued  fractions  is  used  to  syn¬ 
thesize  general  inhomogeneous  ladder  netx^orks. 

The  analysis  and  synthesis  results  established  are  being  extended 
to  the  case  of  infinite  ladder  networks  (N  -♦  «)  and  the  problem  of  the 
transition  between  lumped  and  distributed  networks. 

ACKNOWLEDGEMENT 

The  authors  wish  to  thank  Dr.  David  Brown  of  the  University  of 
Xnsconsin,  for  his  criticism  and  Dr.  C.  E.  Carroll  of  the  University  of 
Pennsylvania  for  his  idea  in  the  proof  of  Lemma  2. 

REFERENCES 

1.  Bellman,  R.,  "A  Note  on  an  Inverse  Problem  in  JIathematical  Physics, 
Quarterly  J.  Mech.  Appl.  Math.  JI5  (l95l). 

2.  Anderson,  L.  E.,  "On  the  Defective  Determination  of  the  Vave 
Operator  from  Given  Spectral  Data  in  the  Case  of  a  Difference  Equa¬ 
tion  Corresponding  to  a  Sturm-Liouville  Differential  Equation,"  J. 
Math.  Anal,  and  Appl.  ^  (19T0)' 

5,  Lee  and  Brown,  "Decomposition  Theorem,"  Allerton  Proc.  on  Ckt.  and 
Syst.  Th.  (197^). 

1}-.  Evans,  XT.,  "Control  System  Dynamics,"  McGraw  Hill,  pp.  9^-120  (195^)  • 


305 


The  root  locus  equation  of 
Figure  U 
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AUTOMATIC  NUMERICAL  INTEGRATION  USING 
VP-SPLINES 

Royce  W.  Soanes,  Jr. 

Research  Directorate 
Benet  Weapons  Laboratory 
Watervliet  Arsenal 
Watervliet,  New  York  12189 


ABSTRACT.  A  method  of  exploiting  VP  (variable  power)  splines 
for  the  purpose  of  automatic  numerical  quadrature  is  presented.  The 
essence  of  the  adaptive  method  given  here  is  to  select  mesh  points 
near  the  node  where  an  upper  bound  on  the  local  area  discrepancy  be- 
twGGn  the  trdpGzoiddl  GStimatG  and  the  local  VP  splinG  GStiiriatG  of 
the  integral  is  a  maximum.  A  comparison  is  made  with  Gaussian  quad¬ 
rature  for  an  integral  containing  a  parameter. 

1.  INTRODUCTION.  The  term  "Automatic  Integrator"  refers  to  ^ 
numerical  integration  algorithms  which  adapt  themselves  to  the  parti¬ 
cular  situation  at  hand.  Automatic  integrators  are  particularly 
handy  for  obtaining  dependable  integral  estimates  during  computation 
on  a  problem  which  may  involve  many  integrals  and  whose  nature  may 
change  from  time  to  time  as  the  parameters  involved  fluctuate.  They 
are  also  useful  in  situations  where  the  integrand  may  be  expensive 
(time  consuming)  to  evaluate  as  is  the  case  with  multidimensional  in¬ 
tegral  s . 

The  basic  philosophy  behind  the  automatic  integration  in  this 
article  will  be  to  spend  some  computational  overhead  time  in  monitor¬ 
ing  the  region  of  the  integrand  where  the  VP  spline  interpolater  is 
making  the  most  significant  contribution  to  the  integral  estimate 
(relative  to  the  linear  interpolater)  and  evaluate  the  integrand  in 
these  significant  regions. 

As  increasingly  more  information  is  accumulated  about  the  inte¬ 
grand,  it  will  be  possible  for  the  algorithm  to  gradually  abandon 
evaluation  of  the  integrand  over  large  regions  of  uniform  behavior 
and  transfer  its  attention  to  regions  where  the  integrand  behaves  more 
abruptly.  This  process  will  generally  produce  a  nonuniform  mesh  and 
it  will  be  necessary  to  have  on  hand  an  interpolater  which  is  smooth 
but  stable.  Variable  power  splines  satisfy  this  requirement  since 
they  are  twice  differentiable  and  they  may  be  given  some  local  deri¬ 
vative  control  which  renders  them  less  likely  to  inject  interpolatory 
oscillations. 

2.  SUMMARY  OF  BASIC  VP  SPLINE  FORMULAS.  The  interpolatory 
functions  used  here  are  the  VP  (variable  power)  splines  given  on  the 
■jth  subinterval  by  Eq.  (1). 
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m,*  n,- 

(1)  k^y,-(x)  =  ^i+b^r^'+c.r^  +d^-(l-r'^) 

where  , 

r^*  =  (x-x^)Mj,  and 

£1  =  X,-+]-X^ 

The  four  parameters  aj,  b|,  cf  and  d-j  may  be  eliminated  in  favor 
0^  yi .  yi+l .  y\  and  yj+T  . 

(2)  a,-  =  k^y,.+il^.(m|q,-(m^-l)yj-yl+l) 

(3)  b,-  =  ^i(-m,-n,-q,-+m,-yj+n^yj+l) 

(4)  C|  =  J^i(n,-qi--yj-(ni-l)y-+i) 

(5)  d^  =  A,-{-mjqi+(mj-l)y]+yj+i) 
where  q^  =  (y^+i-y,- )/&,• 

If  second  derivative  continuity  is  enforced  at  the  interior  nodes 
and  the  curvature  is  set  equal  to  zero  at  the  end  points,  the  follow¬ 
ing  tridiagonal  system  of  equations  may  be  obtained. 

(6)  ~1  )y^ ■ty2  “  ni^qi 

(7)  Vi-i+Vl+Vi+l  "  “^i 

(8)  yN-i+(nN-i-i)yj<  = 

The  coefficients  in  Eq.  (7)  are  given  by  equations  (9-12). 

(e)  =  m,._^(m|.i-l)/(k,._^£,._,) 
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(10)  =  n^(n^-l)/(k,*£^) 

(11)  =  (n^_i-l)A^+(nii-l)Ci 


(12)  =  ni.iA^q^.i+m^C^qi 

The  solution  to  the  system  described  by  equations  (6-12)  yields 
the  nodal  derivatives  which  insure  continuity  of  the  second  derivative 
of  the  interpolater. 

All  that  is  needed  now  to  completely  define  the  interpolater  is 
the  setting  of  the  nonlinear  parameter  vectors  m  and  n.  The  values  of 
mi-l  and  n<  are  set  by  obtaining  a  VP  spline  over  the  restricted  node 
set  [xi-i,  xi,  xi+i].  Setting  the  end  curvatures  equal  to  zero  and 
setting  yi  equal  to  the  slope  of  the  line  through  (xify^j)  which  makes 
equal  angles  with  the  linear  interpolater  on  the  left  and  right  of 
yields  Eq.  (13). 


(13)  n^/m^_l  =  (qi-T''l)/(qi''’l 


Equation  (13)  sets  the  m's  and  n's  while  assuming  a  lower  bound 
of  L  on  them  i.e.,  either  ni  =  L  or  m^.i  =  L.  This  lower  bound  L  must 
be  greater  than  2  and  it  need  not  be  greater  than  3.  Values  of  L 
greater  than  3  tend  to  produce  too  much  flattening  of  the  interpolater 

between  nodes. 

3.  INTEGRATION  FORMULA.  If  the  VP  spline  is  integrated  over  the 
^th  subinterval,  we  may  obtain  Eq.  (14)  after  some  rearrangement  and 

simplification. 

(14)  /’^^'^\i(x)dx  =  (t^/2)(y|+y^+i)  + 

Xi 


where 
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The  quantity  A,-  Is  the  discrepancy  between  the  trapezoidal  esti¬ 
mate  and  the  VP  spline  estimate  of  the  integral  over  the  ith  subinter- 
val.  This  expression  for  a*|  Is  not  dependent  on  the  existence  of 
second  derivatives. 

If  qi  is  between  q^.-j  and  q^+i  and  yj^  is  between  q|^_i  and  q|^ 

(k  =  i,  i+1)  and  mj  =  n^  =  m,  the  maximum  value  that  |a,-|  may  take  on 
|/(6+4/2)  for  an  m  of  1  +  /2. 

_ SIGNIFICANT  NODES.  An  initial  mesh  over  the  desired  inter¬ 
val  of  integration  must  be  assumed.  This  mesh  may  be  uniform,  or 
prior  analytic  knowledge  of  the  integrand  may  prompt  the  insertion  of 
a  node  or  two  near  an  abruptness  in  the  integrand.  In  any  case,  the 
initial  mesh  may  be  uniform  or  non-uniform  and  may  contain  as  few  as 
three  points. 

The  relative  significance  of  the  various  points  in  the  sample 
must  be  determined  first.  This  will  be  done  by  considering  the  be¬ 
havior  of  a  VP  spline  with  zero  end  curvatures  over  the  restricted 

node  set  xj,  xj+i].  Enforcement  of  second  derivative  continu¬ 

ity  at  node  i  yields  Eq.  (15). 


(15)  R,  = 


This  equation 
mentioned.  If  Eq. 
vatures,  two  simple 
point  VP  spline. 


implies  Eq.  (13)  with  yj  selected  as  previously 

(14)  is  used  with  the  conditions  for  zero  end  cur- 
integral  formulas  may  be  obtained  for  the  three 


(16) 


x^ 

^yi-lW<*x  “  ^^i-i/2)(y^..,+y^)  +  u^ 


/^J‘^Vi(x)dx  =  (£,-/2)(y|+y^+,)  +  v^ 

where 

“1  •  {<tf.i/2)(q,.i-yj)/(ni,.,+1) 

and 

Vi  =  (V2)(yi'-qi)/(n|+l) 
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The  two  area  discrepancy  terms  and  Vi  will  be  used  to  deter¬ 
mine  the  significant  points  of  the  sample. 

At  this  point,  we  want  to  notice  the  effect  of  Yi*  on  u,*  and  Vi 
as  it  varies  between  the  left  and  right  difference  quotients  qi_i  and 
q<  which  are  taken  to  be  the  reasonable  limits  for  the  assignment  of 

y|  locally. 

From  Eq.  (15)  we  see  that  as  y\  approaches  qi,  Ri  approaches 
infinitv  The  value  L  is  therefore  assigned  to  mi-i  as  ni  becomes  ^ 
infinite!  The  quantity  ui  therefore  approaches  its  extreme  value  u^ 
as  v.j  approaches  zero. 


(18)  u*  =  (^-i-i/2)(qi -1-^1  )/(*-+!) 


Similarly,  as  y^  approaches  q-|_i,  Ri  approaches  zero.  Hence,  ni*  is 
assigned  the  value  L  as  mi-_i  becomes  infinite.  We  therefore  have  v^ 
approaching  its  extreme  value  v*  while  ui  approaches  zero. 

(19)  vj  =  (ilf/2)(qi.i-qi)/(L+l) 

These  extreme  values  of  Uj  and  v-f  gives  us  the  significance 
weights  that  we  will  assign  to  the  nodes  in  the  sample. 

(20)  w^  =  (4-i+4n  qr^i-il 

5.  INTEGRATION  ALGORITHM.  The  weight  given  by  Eq.  (20)  is  pro- 
portional  to  the  sum  of  |u||  and  Ivth  It  is  an  easily  calculated 

measure  of  the  possible  disagreement  which  may  exist  between  the  VP 
spline  estimate  of  the  integral  locally  and  the  linear  estimate.  It 
behooves  us,  therefore,  to  examine  the  integrand  more  closely  near  the 
node  where  wj  is  presently  the  largest.  An  algorithm  for  automatic 
integration  may  therefore  be  summarized  by  the  following  procedural 
outline. 

I.  Generate  an  initial  (not  necessarily  uniform)  mesh  over  the 
interval  of  integration,  evaluate  the  integrand  and  compute 
the  trapezoidal  estimate  of  the  integral. 

II.  Compute  the  ^th  nodal  significance  weight  according  to  Eq. 
(20)  for  l<i<N. 

III.  Find  the  node  where  w^  is  the  largest. 
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IV.  If  the  maximutn  weight  is  less  than  a  given  fraction  of  the 
trapezoidal  estimate  or  if  the  number  of  functional  evalua¬ 
tions  exceeds  a  given  amount,  skip  to  VII..  otherwise  con¬ 
tinue  to  V. 


V.  Evaluate  the  integrand  at  the  midpoint  of  the  n-th  (i-1**’) 
subinterval  if  is  larger  (smaller)  than 

VI.  Update  the  x  and  y  arrays  and  the  trapezoidal  estimate  and 
recalculate  the  three  appropriate  nodal  weights.  Return  to 
step  III. 


VII.  Set  the  m's  and  n's  according  to  Eq.  (13). 

VIII.  Compute  the  nodal  derivatives  using  equations  (6-12). 

IX.  Compute  the  VP  spline  integral  estimate  using  Eq.  (14). 

—  .J^  following  integral  containing  a  parameter 

IS  considered  here  as  a  test  case;  it  is  obtained  from  a  Weibull 
probability  density. 


2  ^ 

(21)  /  k(b)x^‘'’e“^  dx  =  1 

0 

where  k(b)  =  b/(l-e'^^) 


As  b  becomes  large,  the  integrand  will  become  a  tall  spike  cen¬ 
tered  near  1.  The  performance  of  VP  spline  adaptive  integration  will 
e  compared  with  that  of  32  point  Gauss-Legendre  quadrature.  It  is 
obvious  that  any  quadrature  formula  using  a  constant  mesh  may  be  de¬ 
feated  by  this  integral  if  b  is  chosen  large  enough.  The  purpose  of 
the  comparison  is  therefore  not  to  belabor  this  fact  but  to  indicate 
that  the  adaptive  method  is  capable  of  handling  even  this  pathological 
case  accurately  and  stably.  ^ 
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The  following  error  table  was  computed  for  an  L  of  2.5.  Only  32 
functional  evaluations  were  made  for  the  VP  spline  integral  estimates. 


VP 


GAUSS 


2 

4 

6 

8 

10 

12 

14 

16 

18 

20 

22 

24 

26 

28 

30 

40 

50 

70 

90 

no 

130 

150 


.00012 
-  .000019 
.00016 
.00028 
.00054 
.00048 
.00014 
.00010 
.00041 
.00066 
.00013 
.000018 
.00048 
.00034 
.00037 
.00071 
.00019 
.00031 
.00043 
.0010 
.00074 
.00062 


- . 00000000000000036 
-.00000000000000014 
.0000000000060 
.00000030 
-.000043 
.00011 
.0032 
.0069 
.0034 
-.012 
-.04 
-.079 
-.13 
-.18 
-.23 
-.47 
-.60 
-.78 
-.90 
-.95 
-.98 
-.99 


For  well  behaved  integrands,  Gaussian  quadrature  seems  to  be  un¬ 
beatable  -  as  evidenced  by  the  early  entries  in  the  table.  The 
Gaussian  accuracy  deteriorates,  however,  as  its  mesh  becomes  lep 
capable  of  detecting  the  spike.  By  the  time  b  has  reached  a  value  of 
150,  Gaussian  quadrature  has  "lost"  99%  of  the  integral  value.  Adap¬ 
tive  VP  spline  integration,  although  not  as  accurate  as  Gaussian  for 
small  values  of  b,  displays  a  uniform  error  pattern  which  is  independ¬ 
ent  of  b  over  a  considerable  range. 


Needless  to  say,  a  much  better  parametric  study  than  has  been 
done  here  could  be  done  for  a  variety  of  integrands.  Fortran  list¬ 
ings  of  relevant  subroutines  are  given  here  as  an  appendix  for  those 
interested  in  using  adaptive  integration  in  a  practical  setting  or  for 
those  who  might  be  able  to  do  a  more  complete  parametric  study. 
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_  _ 

v4  .O0.t 

NPNTS,X,Y=nATA 

^NtiOAt-Dei^i-VATi  - - - - 

m,n=variable  powers 


implicit  real«8  <a-h,u--z» 
dimension  X(1),  Y(l),  D(l),  M(l), 
DOUBLE  PRECISION  M»  Nf  Kit  KIM 
dimension  AdOOI,  B(IOO),  C(IOO) 

np^pnt-s _ _ _ _  l!_ 

NM=NP-1 


N(U 


1.) 


--OXIM=X(2)-X(d 
QIM=(Y(2)-Y{1) )/DXlM 

KlM=l.-(M(  l)-i.  )  1) 

^  B(1)=M(1)-1, 

Of  I ) =M( 1 )*OIM 

define  TRIDIAGONAL  SYSTEM 

DO  I  1=2, NM 

0XI=X(I  +  1)-XU) 

Qr  =  (  Y(  i  +  n-Yt  n  i/oxi 
i(I-l.-^MdT-l*)^^:f^H  n-1.1 

Af  n=Mf  1-1 

Cl  1 )=N( i )*(N< I )-l. )/(KI*OXI  ) 

-  n!  I  ^*'*^*^**>“'i*>*C(  r ) 

Of  n  =N{  I-l  )*A(  I  )^OIM+M(  I  >*C(  n*OI 

OX  I M  =  DX I 

- OlM=Of - - - 

kim=ki 

-  i  CONTINUE  - - - - 

A(NP)=1. 

B<NP>=N(NM)~1. . -  .  - 

D(NP)=N(NM)*0I 

RWUCt^  MATRIX  BFIGW  THE-0TAGONAt - 

DO  2  1=1 ,NM 

Q=A( i  +  i )/6( n  _ _ _ 

B  (  l  +  l )  =B  ( I  +  1  )-Ot:C  (  I  ) 

Of  i+i )  =01 1 +  n-o«of  n  — 

2  CONTINUE 

-  BACK  SUBSTITUTE--  - ^ _ _ 

on  3  J=1,NM 

I=NP-J  . 

01  I  +  1) =D ( I  +  1  I /p  f I  +  1 j 
01  n=Dl  I  )-C(  I  )«D(  !  +  l  1 

3  CONTINUE 

-  {Xii^oi  1  >/fiH  1 - - : _ 

RETURN 

END  ... 


- - -VRSOOOBi— 

VPSD0002 

VPS00003 - 

VPS00004 

VPS00005 

VPS00006 

VPStKIOD?- 

VPS00008 

VPS00009 

VPSOOOlO 

VPSOOOll 

VPSD0012 

—  - VPSDD0i3- 

VPS000I4 

VPS0001"5- 

VPSD0016 

VPSOOOi? 

VPSD0018 

- VPSODB^r- 

VPSD0020 
.  VPS00021- 

VPS00022 
VPS00023- 
VPSD0024 

- - VPSD0025^ 

VPS00026  * 

'  VP  SO  002^-1 
VPSD0028  ! 
VPSD0029-j 
VPSD0030 

- VPS0003i- 

VPS00032 

VPS00033 

VPSD0034 

VPSD0035- 

VPSD0036 

—  - - VP$00<)37- 

VPSD0038 
VPS00039- 
VPSD0040  ; 
VPS00041- 
VPSD0042 

- VPSTH)04^3“ 

VPS00044 

-  VPS00045 

VPS00046 

VPSU0047 

VPSD0b48 

- VP5CH>04R- 

VPSD0050 
-  VPS00051- 
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- SiieftWT-lNP  -«^,NPNT5#X#Y#-Mii^tX*MAXeX.NT0Ti^iNf - 

AOVPSI  -  ADAPTIVE  VARIABLE  POWER  SPLINE  INTEGRATION 

. f=iNTEGKANU 

NPNTS»X=INITIAL  MESH 

Y=INTEGRANO  VALUES  '  ^ 

MINEX*MAXrx=MINl MUM  AND  MAXIMUM  NO,  OF  EXTRA  POINTS  TO  BE  GENERA 

- fi  T^T^T^iT  AL-“NUi— tip  -T^O  I  NT  5  “IN  TNP  S  AMPL'P 

YINT=UEFINITE  INTEGRAL 

AOVPSI  MAY  EASILY  BE  CHANGED  TO  YIELD  THE  INDEFINITE  -  - 

INTEGRAL  AS  WELL 

TOL=STOPPING  TOLERANCE  ■  ^ 

IF  TOLERANCE  IS  MET,  TOL  WILL  BE  GREATER  THAN  THE  MAXIMUM  LOCAL 

- INTEGRAL -0T5CTLPPANCY-  OlVIOEO  BY  THE  TRAPP ZOIC At  ESTIMATE- 

THIS  ROUTINE  CALLS  VPSD  AND  MNSET 

IMPLICIT  REAL*8(A-H,0-Z)  . 

DIMENSION  XIl),  Y(l) 

-  DIMENSION  DUDGTf-MIlOO),  N(100),  WI100) 

DOUBLE  PRECISION  M,  N,  L,  Kl  _ 

- eOMMON-yfH:^py-Y-tNI^-OCI -  * 

TOLLP=TOL*(L+l. )  _  _  _  _  _ 

- EVALUATE -INTEGRAND  OVER  INITIAL  MESH . . 

DO  1  I  =  l,NPr4TS 

1  Y<n=F(XH))  '  ' 

T1N  =  0.  _  _ 

NTOT^NPNTS 

NM=NTOT-l  -  -  . -  - .  " 

COMPUTE  TRAPEZOIDAL  ESTIMATE  OF  INTEGRAL 
DO  2  1=1 fNM 

2  TIN  =  TIN+(Xn  +  l)~X(  n  )»IY(  I  )+YI  I  +  l)  )  _ _ 

- DXIM=^X<2»-XI  H - - - 

0IM=(Y{2)“Y(1) )/OXIM 

-  COMPUTE  SIGNIFICANCE  WEIGHTS  FOR  EACH  NODE  .  ' 

DO  3  1=2, NM  _ 

—  oxi=xn+i)'-xu>  -  -  -  ~  — 

QI=IY{I+1)-Y(I))/DXI 

- n=I  DXIM**2+DXI**H  I'^DABSIOI— DIMf  -  — 

DXIM=DXI 

QIM  =  OI  -  - .  " 

3  CONTINUE 

FIND  MOST  SIGNIFICANT  NODE 

4  WMX=W(2) 

- IMX»2 - - - -  '  ■ 

DO  7  I=2,NM 
IF  (WMX-W(in  S,6,6 

5  WMX  =  wn)  _ 

IMX=1  ^ 

6  CONTINUE  _ _ _ 

--7-CONTIHUE - - - . .  ~ 

CHECK  FOR  EXIT 
IF  IMEX-MINEXI  10,8,8 

8  IF  ( WMX-TnLLP*DABS(TlN)  )  21,21,9 

9  IF  (NEX-MAXEXI  10,21,21 
10  CONTINUE 

—  I5i=IN0EX-0P  MOST  SIGNIFICANT  SUBINTERVAl- 
1SI=IMX 


— frDVPOOOi- 
ADVP0002 

-  AOVP0003 
ADVP0004 
A0VP0O05 

TEADVP0006 

-  AOVPOOef- 
ADVP0008 

-  AOVPOOD9 
AOVPOOlO 
ADVPOOll 
ADVP0012 

— ADVRDOlrB- 
ADVP0014 
ADVP0015 
ADVP0016 
A0VPOO17 
ADVP0018 

A0VP0020 

'  ^0VPO021 
ADVP0022 
AOVP0023 
ADVP0024 

- AWROD25n 

ADVP0026 
AOVPDO^? 
ADVP0028  I 

-  AOVP0029: 
ADVP0030 

- A^VMDBl- 

A0VP0032 

-  ADVP0033 
ADVP0034 

-  A0VP0035 
ADVP0036 

- ADVPDD37- 

ADVP0038  ; 
AOVP0039 ' 
ADVP0040 
AOVP0041 i 
ADVP0042 i 

— AGVP0D43-i 
ADVP0044  ! 

~  A0VP0045  ; 
ADVP0046  1 
ADVP0047 
ADVP0048 

— ADVP0043- 
ADVP0050 

-  ADVP0051- 
A0VP0052 
ADVP0053 
ADVP0054 

- ADVP0055- 

ADVP0056 
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rt  n 


V-ll,  llvl^ - 

11  ISI=1MX~1 

12  CONTINUE 

C  COMPUTE  POINT  TO  BE  INSERTED 

- XI=TX< I5I1»^<1SI+1) )/2.  - 

Yl  =  F(xn 

~e - - 

K=NTOT 

~  -  f4SH=NTOT-ISf  -  '  -  -  . - 

DO  13  1=1, NSH 

-  X(K+l)=X<K)  -  . 

Y(K+n=Y(K) 

- - - 

K  =  K-1 

13  CONTINUE  ■  . . -  -  - . 

I=ISI+1 

X( I )  =  XI  . . 

Y(I )=YI 

- NMtrttTOT - ^ - — - - - 

NEX=NEX+1 

NTOT=NTOT+l  -  -  -  - - - -  -  -- 

RECALCULATE  NEIGHBORING  WEIGHTS  AND  COMPUTE  ADDITIONAL 

CONTRIBUTION  TO  TRAPE20I0AL  ESTIMATE  OF  INTEGRAL  . . 

IMIO=I 

12=1+1 

IF  Ill-n  14,14,15  -  -  -  . - . - 

14  11=2 

- 15  IF  iI2-NTOTH:f,16a6  -  -  - . -  _ 

16  I2=NM 

- 1^-DXNn^X-HTrl-Xi^i—li - — - 

QIM=(  Y(  I  n-Y(  Il-n  )/OXIM 

- Oe--20  l=fl,l2 - 

DXI=X(  I  +  U-X(  1) 

- ^  I  YT  i  +  IT-YT-IiN  /OX  I - - 

W{  I  )  =  (DXIM*«2+DXI«*2)*DABS(QI<*0IM) 


18  TL=0XIM*{Y(I~1)+Y( n ) 

- TR=DXI»<  YI  1  J+YH  +  1  M-  - 

TM=  (OXIM+DXI  )*(  Y(  I-l  )+Y(  I  +  n  ) 
TIN=TIN+ITL+TR-TM) - 

19  DXIM=DX1 

- QIM=QI - 

20  CONTINUE 

-GO  TO  4 -  - 

C  SET  M*S  AND  N’S 

21  CALI  MnSET  <NTOT,X,Y,H,N,L) 

C  COMPUTE  NODAL  DERIVATIVES 

- C-Al-  tr"  V  P  56-  f  N  Ti3  T>  X  ,  Y  f  O  #  N ,  NT - 

C  COMPUTE  INTEGRAL  OF  VP  SPLINE 

-  YINT  =  0^ - - - 

DO  22  1=1, NM 

OXI=X(  I  +  1)-XT  IT -  - 

QI  =  (Y{I  +  l)*-Y<n)/DXI 
- T=Y<i)+YU  +  H— - - 


- 

ADVP005P 
A0VR0059 
ADVP0060 
"  -  ADVP0061 

A0VP0062 

- ADVP0063 

ADVP0064 

- A0VPO065 

A0VP0066 

A0VP006T 

ADVP0068 

- “ADV^OO<r9' 

ADVP0070 

A0VPO071 

ADVP0072 

AOVP0073 

ADVP0074 

- ^OVPOOT5- 

ADVP0076 

- A0VP0077 

ADVP0078 

-  ADVP0079 

ADVP0080 

- A6W0t)«l- 

AOVP0082 

-  -  AOVP0083 

ADVP0084 

ADVP0085 

ADVP0086 

- A6VP0087- 

A0VP0088 

-  ADVP0089 

A0VP0090 

—  - A0VM091 

ADVP0092 I 

- A0VPO093~| 

ADVP0094i 
A0VP0095  I 
AOVP0096 . 
A0VP0097, 
ADVP0098  ' 

- A0VP0099- 

ADVPOlOO 

- AOVPOlOi-j 

ADVP0102 

-  -  ADVP0103 

ADVP0104 ; 

- A6VP(HT35-’ 

ADVP0106 

-  ADVP0107- 

ADVP0108 

-  ADVP0109- 

ADVPOllO 
- AOVPOll^ 
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- ,',r7n,  nii^i  Itl 

€2*2**i  0<  I'*-l  )"^M  1) ) 

E3sDXl  /(K1«<MC  n-n«)*(N(n  +  i.)) 

C  1  =  0X1  ♦(T^E3*(E1  +  H2M 

Y1NT=YINT+CI  .  _ „  _  _ 

-22-eoHTfNiie— - - - 

YINT=YINT/2. 

'  -  RFTURN  . 

END 


ADVP0U3 

^OVPOll^ 

ADVP0115 

A0VP0116 

ADVP0117 

ADVP011&- 

ADVP0119 

ADVP0120 

A0VP0121- 


MNSET  -  MNSET  SETS  THE  H'S  AND  N'S  FOR  A  VP  SPLINE  _ _ 

— NPNT5*X,Y=I>ATA- -  •  „  en.  IMC 

M,N=VARIABLE  POWERS  FOR  A  VP  SPLINE  . 

t-i.sfTM“«.?E.";«rs ^eeo  ^ 

-si^s;vsry.:rE"roSEr.is^rEts»HT„tr,jr.r,»TEEEo^ 

THE  LEFT  AND  RIGHT  OF  THE  POINT 
IMPLICIT  REAL^B  (A-H,0-Z) 

DIMENSION  XIDf  Ydlt  Mdlt  Nd) 

DOUBLE  PRECISION  Mt  Ni  L  _ _ 

— -NM=NPNT5-1 — - 

Nd)=L  . 

-  M(NM)=L 

DXIM=X(2)-Xd)  .  _  - 

0IM=IY{2>-Ydd/OXiM 

DO  4  l=2fNM  _ _ _ _ _ 

— -DX1  =  XT  dU-XTill - 

_ Ri<DXl/OXIM)ioURTdl.  +  OIM«*2)/d.+OI**2n 

IF  (R-1.)  ltl*2 

1  Nd  )=L 

Md-l)=L/R  _  • _ _ _ 

— oe-fo  - - 

2  M(  1-1)=L  - -  - -  - 

N( I )=L*R  - 

3  DXIM=0XI  -  -  - 

OIM=OI  - 

4  CONTINUE  _  _ 

-RETURN - - - 

END 


-MN-seeoot- 
MNSE0002 
-MNSeOM5* 
MNSE0004 
MNSE0005 
MNSE0006 
"HN^eODDi^- 
MNSEOOO0 
MNSE0009 
MNSEOOlO 
MNSEOOll 
MNSE0012 
--MNSeOOt^ 
MNSE0014 
MNSE0015  i 
MNSE0016  j 
MNSEDOIT 
MNSE0018  I 
— HNScOOl^  ; 
MNSE0020 
HNSE0021  ; 
MNSE0022 

-  MNSE0023 
MNSE0024 

-  HNS€-D02^- 
MNSE0026 

-  MNSE0027 
MNSE0028 
MNSE0029 
MNSE0030 

~-MNSt<>031— 

MNSE0032- 
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TIME  EVOLUTION  OF  M  ORTHOGONAL  MATRIX 
James  M.  NUkes 

Army  Materiel  Test  and  Evaluation  Directorate 
White  Sands  Missile  Range,  NM 


ABSTRACT.  The  usual  method  of  computing  a  rotation  matrix  as  a  function  of 
the  Euler  angles  is  discussed.  On  a  digital  computer  these  angles  must  be 
obtained  by  a  numerical  integration  of  the  angle  derivatives,  which  are  func¬ 
tions  of  the  angular  velocity  components  of  the  rotating  coordinate  system. 

The  numerical  integration  in  effect  imposes  a  rotational  motion  with  constant 
angular  velocity  over  a  time  interval  of  length  equal  to  the  integration 
step-size.  This  constancy  of  the  angular  velocity  is  exploited  to  formulate 
a  simple  secondorder  differential  equation  for  the  orthogonal  matrix  describing 
the  rotation.  The  equation  is  easily  solved  exactly,  and  gives  an  expression 
for  the  matrix  at  the  end  of  an  integration  interval  as  a  function  of  the  matrix, 
and  of  the  angular  velocity  components,  at  the  beginning  of  the  interval.  The 
second  method  avoids  some  of  the  difficulties  of  the  Euler  angle  method,  and  can 
be  usefully  applied  in  digital  simulations  of  rigid-body  motion. 

1.  INTRODUCTION.  A  mathematical  model  of  the  motion  of  a  rigid  body  requires 
information  regarding  the  relation  between  two  cartesian  coordinate  systems, 
one  of  which  is  rotating  with  respect  to  the  other.  This  information  is  contained 
in  the  nine  elements  of  the  matrix  R  describing  the  change  of  basis  from  one 
coordinate  system  to  the  other.  The  physical  requirement  that  the  magnitude  of 
a  vector  be  invariant  under  a  change  of  basis  due  to  a  rotation,  imposes  the 
following  mathematical  condition  [1]  on  R; 


where  I  is  the  identity  matrix,  and  the  T-superscript  denotes 
This  condition  is  referred  to  as  the  orthogonality  condition. 


the  matrix  transpose, 
and  R  is  said  to  be 


an  orthogonal  matrix. 


325 


Equation  (1.1)  represents  nine  linear  equations  In  the  nine  elements  of  R, 
which  would  uniquely  determine  those  elements  but  for  the  fact  that  Rr'^  =  I  =  r'^r 
IS  a  _symmetrlc  matrix.  Due  to  this  symmetry,  only  six  of  the  equations  are 
linearly  Independent.  The  three  undetermined  elements  serve  to  parameterize  the 
(Infinite  number  of)  different  rotation  matrices,  and  the  set  of  all  such  matrices 
constitutes  the  three  parameter  group  of  orthogonal  matrices. 

A  popular  choice  for  the  parameters  Is  a  set  of  three  angular  coordinates 
6l  ,  02  >  and  63,  known  as  the  Euler  angles  [2].  With  this  choice  the  matrix  R 
can  be  written  as  a  product  of  three  separate  rotations,  through  each  of  the  three 
Euler  angles.  At  least  two  potential  difficulties  accompany  this  parameterization. 
The  first  is  a  matter  of  economy  of  computation.  Once  the  values  of  the  Euler 
angles  have  been  determined,  one  still  must  compute  the  matrix  elements  of  R  as 
sums  and  products  of  trigonometric  functions  of  the  angles.  Such  computations 
can  become  very  time-consuming,  and  therefore  expensive,  on  a  digital  computer. 

The  second  problem  Is  of  a  mathematical  nature.  It  can  be  shown  that  for  a  given 
sequence  of  Euler  rotations,  the  angular  velocity  components  w.,  1  =  1,2,3,  in 
the  rotating  basis,  can  be  expressed  as  linear  functions  of  the  Euler  angle 

derivatives  6^,  1  -  1,2,3.  That  is,  at  any  time  t,  one  has  relations  of  the 
following  form: 


a)i(t)  =  2G^j(e^(t),e3(t))  e^(t)  , 


i  =  1.2,3, 


(1.2) 


where  all  summations  are  understood  to  be  from  1  to  3,  on  repeated  indices  of  the 
summand.  (The  coefficient  matrix  G  depends,  in  general,  only  upon  the  last  two 
rotation  angles  of  the  rotation  sequence.)  To  determine  the  angles,  one  must 
first  solve  (1.2)  for  the  derivatives  of  the  angles,  and  then  integrate  these 
derivatives.  The  solution  of  (1.2)  for  the  derivatives  involves  inverting  the 
matrix  G.  However,  for  certain  values  of  the  Euler  angles,  the  determinant  of 
G  vanishes,  hence  G  does  not  exist,  and  the  Euler  angle  method  fails  for  those 
values  of  the  angles. 
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The  following  observations  are  important  for  developing  an  alternate 
method  of  computing  a  rotation  matrix.  In  a  digital  model  all  integrations 
are  performed  numerically.  Typically,  a  numerical  method  requires  for  the 
computation  of  the  value  of  a  variable  the  previously  calculated  value  of  the 
variable  and  its  derivative.  For  illustrative  purposes,  consider  a  numerical 
integration  based  on  a  first-order  Taylor's  series.  Assuming  the  values 
6.(0)  and  9.(0),  i  =  1,2,3,  to  have  been  computed  at  the  beginning  of  an 
integration  interval  (which  we  take  for  convenience  to  be  t  =  0) ,  this  method 
computes  the  following  values  for  the  Euler  angles  at  the  end  of  an  integration 
interval  of  step-size  x  : 


0^(t)  =  x0^(O)  ,  i  -  1,2,3. 


(1.3) 


For  values  of  the  Euler  angles  for  which  the  coefficient  matrix  G  in  (1.2)  is 
non-singular,  we  find  from  (1.2): 


0.(0)  =  zgT;  (e.(o),0  (0))  w  (o)  ,  i  =  1,2,3. 

1  ij  z  d  j 

Substituting  (1.4)  into  (1.3)  then  yields  for  the  new  values  of  the  angles 

e.(T)  =  6.(0)  +  tE  gT^  (0  '(O),0  (0))  m  (0)  ,  i  =  1,2,3  . 

i  i  ij  z  o  j 


(1.4) 


(1.5) 


In  (1.5)  the  angular  velocity  dependence  of  the  new  values  involves  only  the 
previous  values  m . (0) .  Since  the  elements  of  R(t)  can  be  constructed  as  functions 
of  the  0.(t),  the  values  to .  (0)  are  the  best  values  of  the  angular  velocity  com¬ 
ponents  available  for  computing  R(t) .  Hence,  for  digital  computation  purposes 
the  angular  velocity  components  can  be  considered  to  have  the  constant  values 
a).(O)  on  time  Intervals  equal  in  length  to  the  integration  step-size,  that  is, 
for  all  te [0 ,t ] . 


This  constancy  of  the  angular  velocity  on  integration  intervals  allows  us  to 
formulate  and  solve  a  simple  second-order  differential  equation  for  R.  The  solution 
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allows  a  direct  computation  of  R(t)  as  a  function  of  the  initial  matrix  R(0) , 
and  the  angular  velocity  components  (0^(0),  j  =  1,2,3.  For  the  case  R(0)  «  I 
(that  is,  when  the  two  coordinate  systems  initially  coincide),  the  result  is  the 

^xp^sssion  [3,4]  for  the  matrix  describing  rotations  about  an  arbitrary 
fixed  axis.  Although  the  method  we  describe  is  thus  fairly  well-known  (it  was  in 
fact  developed  for,  and  is  being  successfully  applied  in,  a  large  digital  missile 
simulation  [5]),  the  derivation  given  in  Section  3  is  believed  to  be  new  and,  in 
our  opinion,  much  more  straight-forward  than  the  geometrical  arguments  given  in 
the  usual  derivations  [3,4]. 


SOME  PROPERTIES  OF  ANTISYMMETRIC  MATRICES.  By  definition,  an  antisymmetric 
matrix  A  is  a  square  matrix  satisfying  the  identity  a'^  =  -A.  From  this  identity 
one  can  easily  deduce  the  following  general  form  for  a  3x3  antisymmetric  matrix: 


A  = 


-a, 


(2-1) 


Introducing  the  Levi-Civita  permutation  symbol  e.  =  1,  c  =1  (-1)  for 

ijk  123  ijk 

even  (odd)  permutations  of  1,2,3,  and  =  0  if  any  two  indices  are  the  same), 

the  matrix  elements  of  A  can  be  written  concisely  as 


A.  . 


ijk  \ 


i,j  =  1,2,3. 


(2.2) 


By  taking  the  product  of  A  with  itself,  we  obtain  the  matrix  elements  of  A^  in  the 
form 


„  2  , 

A.  .  -  -a  6 . .  +  a . a .  , 

11  1  j  ' 


(2.3) 


where  6_  is  the  Kronecker  delta  symbol  =  1  if  i  =  j  ,  and  =  0  if  i  ^  j), 


iJ 


ij 
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2  _  2  ,  2  .  2 
and  where  a  -  ^2  ^3 


Defining  a  symmetric  matrix  S(a)  by 


(2.4) 


one  can  write  A  as 


=  -a^I  +  S(a) 


(2.5) 


It  is  easy  to  show,  using  (2.2)  and  (2.4),  that  AS(a)  -  0,  hence,  multiplying 
both  sides  of  (2.5)  by  A  gives  the  very  useful  identity; 


*3  2  . 

A  =  -a  A 


(2.6) 


3,  the  differential  equation  for  R.  Assuming  the  elements  of  R  to  be  dlfferen 
tiable  functions  of  time  on  the  Interval  I0,t],  we  differentiate  both  sides  of 
(1,1)  to  obtain 


R(t)  R^(t)  +  R(t)  R^(t)  =  0, 


(3.1) 


where  R  is  the  matrix  containing  the  derivatives  of  the  elements  of  R,  and  we 
note  that  1  =  0.  Defining  a  new  matrix  by 


fi(t)  =  R(t)  R  (t)  , 

we  obtain  from  (3.2)  and  (3.1),  and  the  identity  (AB)'*^  =  bV  : 


(3.2) 


T _ T  _ 


(t)  =  R(t)  R^t)  =  -R(t)R  (t)  =  -[R(t)R  (t)]  =  -n  (t)  , 


and  it 


follows  that  the  matrix  is  antisymmetric.  By  (2.1)  a  can  be  written 


in  the  general  form 
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0 


fi(t) 


w^Ct)  -t02(t). 
-u^Ct)  0  a)^(t) 

a)2(t)  -w^Ct)  0 


(3.3) 


It  is  demonstrated  in  several  textbooks  [6,7]  that  the  elements  of  Q  ,  defined 
by  (3.2),  can  be  identified  with  the  components  in  the  rotating  basis  of  the 
angular  velocity  vector.  As  discussed  in  the  Introduction,  the  best  available 
values  of  these  components  on  the  interval  I0,t]  are  the  previously  computed 
values  0}^  (0)  . 

Setting 


=  fi(0)  ,  0)^  =  to^(O),  j  =  1,2,3,  (3.4) 

and  multiplying  both  sides  of  (3.2)  by  R(t) ,  using  the  orthogonality  condition 
(1.1),  we  obtain  the  following  first-order  differential  equation  for  R: 

R(t)  =  J2R(t)  .  (3.5) 

Since  is  a  constant  matrix  on  I0,t],  (3.5)  can  be  differentiated  to  yield: 

R(t)  =  nR(t)  =  S?^R(t)  ,  (3.6) 

where  R(t)  has  been  replaced  by  (3.5)  in  the  last  equation  of  (3.6).  Multiplying 
(3.6)  by  SJ  now  gives 

”3  -9 

nR(t)  -  Q  R(t)  =  J2R(t)  +  m  ^2R(t)  =  0  (3.7) 

O 

where  we  have  used  (2,6)  for  ^  ,  and  where 

0)2  =  0)^  +  0)2  +  U)2  .  (3.8) 
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Since  is  a  scalar.  It  commutes  with  0  ,  and  (3.7)  can  be  written  as 


Q[  R(t)  +  to2R(t)]  =  =  0  ,  ’ 

where  we  have  defined 
R(t)  +  0)2  R(t)  = 

Equation  (3.10)  is  the  familiar  equation  for  a  forced  harmonic  oscillator, 
except  that  the  "dependent  variable"  is  here  a  matrix  function  R,  and  the  "forcing 
function"  is  an  as  yet  undetermined  matrix  .  It  is  easy  to  show,  using  (3.5), 
(3.6),  and  (2.6),  that  =  0,  so  that  is  in  fact  a  constant  matrix. 

Furthermore,  using  (2.5),  one  can  show  that  =  0  implies  that  0=0,  which, 
from  (3.5)  corresponds  to  the  trivial  solution  R(t)  =  R(0) ,  te[0,T].  By  direct 
substitution  one  can  then  verify  that  the  non-trivial  solutions  of  (3.10)  have 
the  general  form: 

R(t)  =  C  /o)^  +  C^sinojt  +  C^coscot  ,  (3.11) 

^  '  0  1  ^ 

where  and  are  arbitrary  constant  matrices.  To  determine  the  constant 
matrices  in  (3.11),  we  evaluate  R  and  its  first  two  derivatives  (found  by 
differentiating  (3.11))  at  t  =  0,  and  compare  the  results  with  (3.5)  and  (3.6) 
evaluated  at  t  =  0.  The  results  are 


C  /a)2  =  R(0)  +  02R(0)/a)2  , 

0 

=  OR(0)/a)  , 

=  -o2R(0)/a)2  . 


331 


Substituting  these  expressions  into  (3.11),  we  obtain  the  following  solution 
for  the  rotation  matrix  at  time  t  =  t  : 

R(t)  =  [  I  +  (f2/(ji))sinajT  +  (n^/to^)  (l-cosux)]  R(0)  .  (3,12) 

It  is  convenient  to  define  a  "transition"  matrix  X  by 

X(t)  =  I  +  (f2/(i))sin(0T  +  (fl^/oj^)  (l-cosmt)  .  (3.13) 

If  the  matrix  R(0)  is  known,  then  X(t)  defines  the  transition  over  the  interval 
of  length  T ,  to  the  new  matrix 

R(t)  =  X(t)  R(0)  .  (3.14) 

If  the  two  coordinate  systems  initially  coincide,  so  that  R(0)  =  I,  then  R(t)  =  X(t). 
Using  (2.2)  and  (2.3)  in  (3.13),  we  obtain  the  matrix  elements  of  X  in  the  form 

^^^(t)  =  d^jCosojT  +  Ie^jj^((Oj^/a))sina)T  +  (tp^Wj /uj^)  (l-coswx)  j 

which  is  a  slightly  simplified  form  of  equation  (19)  of  Ref.  4  for  the  elements 
of  the  matrix  describing  a  rotation  through  the  angle  wx,  about  an  axis  defined 
by  the  direction  cosines  co^/w  ,  i  =  1,2,3. 

4.  CONCLUS ION .  The  transition  matrix  method  described  in  this  paper  eliminates 
the  Inversion  singularity  problem  of  the  Euler  angle  method,  as  well  as  the  numer¬ 
ical  Integration  of  the  Euler  angle  derivatives  required  by  that  method.  Also, 
the  only  trigonometric  functions  to  be  computed  in  (3.12)  are  sinwx  and  costox  , 
hence  computation  time  should  be  reduced  by  the  transition  matrix  method.  If  so 
desired,  the  Euler  angles  can  be  recovered  at  any  time  from  the  rotation  matrix, 
for  they  are  simply  inverse  trigonometric  functions  of  the  matrix  elements.  We 
remark  that  (3.12)  is  approximately  valid  on  any  interval  for  which  the  angular 
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velocity  is  approximately  constant,  that  is,  on  any  interval  where  the  angular 
acceleration  is  "small".  It  would  appear  that  this  formalism  has  significant 
advantages  over  the  usual  Euler  angle  method. 
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ABSTRACT.  Fundamental  fields  and  weight  functions  are  presented  in  closed 
form  by  algorithm  and  formula, 

1.  INTRODUCTION.  STATES  OF  PLANE  STRAIN.  During  the  last  three  decades 
the  analysis  of  stress  fields  near  the  edges  of  cracks  has  grown  into  a  disci¬ 
pline  of  its  own.  Various  methods  for  the  computation  of  stress  intensity 
factors  have  been  developed.  The  use  of  weight  functions  is  one  of  them.  Origi¬ 
nally  proposed  for  states  of  plane  strain  [l],  the  method  can  be  extended  to 
three-dimensional  fields  [2,  3,  ^1.  In  the  sequel  we  shall  do  this  for  the  con¬ 
figurations  of  the  penny-shaped  and  of  the  elliptic  crack.  The  analysis  is  with¬ 
in  the  frame  of  the  classical  theory  of  elasticity.  Using  a  rectangular  carte¬ 
sian  coordinate  system  x,y,z  we  denote  the  respective  displacements  by  u,y,w  and 
the  stresses  by  a^,  t^v  familiar  manner.  It  is  useful  to  begin  with 

a  review  of  states  of  plane  strain  within  a  cylindrical  elastic  body  V  with 
generators  parallel  to  the  z-axis.  Figure  1  shows  its  cross-section  in  the 
(x,y)-plane.  V  has  mirror  symmetry  with  respect  to  the  (x,z)-plane.  In  the 
same  plane  a  crack  with  faces  C*",  c"  extends  from  the  z-axis  in  the  direction  of 
the  negative  x-axis.  The  boundary  of  V  consists  of  the  crack  faces  and  of  a 
cylindrical  surface  B,  Let  B  be  attacked  by  a  load  of  tractions,  the  latter 
acting  with  components  X,Y  in  x-  and  y-direction  respectively  and  with  X,Y  the^ 
same  along  a  generator.  Assuming  mirror  symmetry  of  the  distribution  of  tractions 
with  respect  to  the  (x,z) -plane  and  imposing  the  constraint  w  =  0  we  obtain  a 
state  of  deformation  in  V  where  u,v  do  not  depend  on  z  (plane  strain)  and  where  a 
suitable  disposition  of  rigid  body  motion  makes  u  an  even  and  v  an  odd  function 
of  y  (mode  l).  Let  x  =  rcos 6 ,  y  =  rsin 9  define  polar  coordinates  r,6.  With 
their  aid  the  asymptotic  behavior  near  r  =  0  of  the  relevant  field  quantities  can 
be  described  as  follows: 


plane  strain 
mode  I 


Figure  1 
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O'  =  “~z:  f(0)cos~  6  with 

f{6)  =  1  -  sin-|osin-|e 
f(0)  =  1  +  sin-i^sin-l^ 
f(0)  =  sin-|  0  cos*|  G 


a  suitable  constant  k  and  where 


for  a  =  O' 

X 


for  a  -  a 

y 


for  cr  =  T 

xy 


\ 


furthermore 
u  = 

V  = 


(/c  »  cos  65)cos-|  0 
{k  ~  cos  0)sin“  e 


/ 


K  ^  ^  kv^  v  =  Poisson’s  ratio 
M-  =  shear  modulus  . 


(1.1) 


(1.2) 


The  constant  k  is  known  as  stress  intensity  factor.  The  asymptotic  relations 
(1.1)5  (1.2)  stay  valid  if  a  bounded  and  smooth  distribution  of  tractions  on 
C  is  admitted  in  accord  with  the  symmetry  of  mode  I.  It  is  customary  to  con¬ 
sider  the  term  r**  /“  in  (l.l)  as  a  point  singularity  in  the  (x,y)-plane  at  the 
crack  tip”  r  =  0.  Nevertheless  the  singularity  is  along  the  whole  z-axis  as  a 
singular  line  (the  edge  of  the  crack).  This  should  be  kept  in  mind. 


Although  the  stresses  are  unbounded  near  r  =  0  the  energy  of  deformation 
per  unit  length  in  z-direction  is  bounded  in  general.  More  precisely  it  is 
bounded  within  any  cylinder  r  =  r  of  sufficiently  small  radius  r  .  If  un¬ 
bounded  the  cause  is  not  asymptotic  behavior  in  accord  with  (l.l)%ut  singular 
behavior  of  the  stress  field  at  points  r  ^  0  of  load  application.  The  latter 
happens  for  concentrated  loads.  If  B  is  smooth  and  if  the  tractions  are 
bounded  and  smoothly  distributed  then  the  energy  per  unit  length  is  bounded.  In 
practical  mechanics  no  other  situations  are  encoiantered.  The  singular  behavior 
(1.1)  of  the  stresses  notwithstanding^  we  are  Justified  to  denote  the  field 
responding  to  the  applied  tractions  as  a  regular  field. 


Let  now  a  field  of  plane  strain  and  of  mode  I  have  the  property  that 
u,v  =  0(r  0  =  0(r’^^^)  near  r  =  0 


(1.3) 


We  shall  call  such  a  field  fun  dame  nt  a 1  if  it  goes  without  body  forces  and  if  it 
displays  no  surface  tractions.  It  is  not  difficult  to  construct  such  a  field. 
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Let  t  ^  0  be  an  arbitrary  constant.  We  set  up 

where 


u  =  u+u,  V  =  v+v 


Ug  =  tr"^/^(|cos|e+  (K-|)cosi0) 

Vg  =  tr'^/^(-|sin|0-  (f<  +  |)sin|e)  > 


(1.4) 


and  where  u„,v„  are  the  displacements  u,v  of  a  s\iitably  chosen  regular  field. 
It  so  happens  lhat  the  displacements  UgjVg  create  a  stress  field  without  bo^ 
forces:  no  tractions  are  induced  on  C+,  C  while  a  system  of  self-equilibrated 
tractions  shows  up  on  B.  We  choose  Ur,Vr  so  as  to  compensate  the  tractions  on 
B.  This  establishes  u,v  by  (1.4)  as  the  displacements  of  a  fundamental  field. 
The  asymptotic  laws  (1.3)  can  be  rewritten  in  the  vein  of  (1.2),  (1.1).  The^ 
details  follow  from  the  explicit  form  of  UgjVg  in  (1.4).  It  has  been  shown  in 
[1]  that  the  construction  (1.4)  yields  ^  fundamental  fields  in  V  of  mode  I. 
The  energy  of  deformation  per  unit  length  is  infinite.  More  precisely  the 
energy  is  already  infinite  within  any  cylinder  r  =  r^,,  no  matter  how  small 


>  0.  We  can  dispose  of  t  by  normalizing  the  fundamental  field.  If  t[K  i;  -  1 


then 


V  = 


X 


on  C  , 


V  s= 


s=  -  |X 


1 


on  C 


(1.5) 


near  x  =  0  . 


We  shall  write  u  =  u^,  v  =  Vj.  if  the  fundamental  field  is  normalized  by 
(1  5)*  setting  u  =  u  ,  v  =  v^.  we  shall  characterize  a  generic  regular  field,  i.e. 
the  mkning  of  Ur,v/will  not  be  restricted  to  (1.4).  Let  us  now  consider  the 
mixed  energy  of  deformation  (per  unit  length)W  associated  with  Uf,Vf  and  . 

To  be  on  the  safe  side  we  exclude  the  cylindrical  domain  r  <  r^  from  V.  In  the 
remaining  portion  the  mixed  energy  can  be  assumed  to  exist.  By  Betti's  theorem 
two  representations  W  =  Wj.f,  W  =  Wf^.  of  the  mixed  energy  are  available.  Here 
Wrf  is  the  work  of  the  tractions  of  the  regular  field  through  the  displacements 
of  the  fundamental  one;  Wf^  is  the  work  of  the  tractions  of  the  fundamental 
field  through  the  displacements  of  the  regular  one.  In  either  case  the  tractions 
on  the  cylinder  r  =  r^  must  be  taken  into  account.  We  can  write 


-  W 


rf 


+  W 


fr 


=  W" 
”rf 


W" 

fr 


(1.6) 


where  primes  refer  to  the  cylinder  r  =  r^  and  double  primes  to  the  boundary  of 
V  outside  that  cylinder;  the  latter  includes  B  and  part  of  C  ,  C  .  Since  the 
fundamental  field  exhibits  no  tractions  on  B,  C  ,  C"  we  find  Wjp^  =  0.  For  suffi¬ 
ciently  small  r  the  left-hand  side  of  (1.6)  can  be  evaluated  with  the  aid  of  the 
asymptotic  relations  (l.l),  (1.2)  for  the  regular  field  and  (1.3),  (1.4)  for  the 
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fundamental  field. ^  In  this  context  (l.U)  must  be  supplemented  by  formulas  for 
the  stresses  to  which  UgjVg  give  rise.  Without  going  into  any  further  detail 
we  observe  that  specified  stresses  a  a.  and  displacements  Wr,Wf  of  regular  and 
fundamental  field  respectively  obey  the  order  relations 

^o'^r  ‘  '^f  ~  =  *^(1)  as  r^  ->  o  (I.7) 

on  the  cylinder  r  =  r^.  Since  W^^  and  W^^  are  representable  as  line  integrals 
circle  r  =  ro  the  asymptotic  relations  determine  the  left-hajid  side  of 
(.1.6;  in  the  limit  r^  ^0.  The  final  result  of  this  procedure  is 

" '  ^ *  Vf  ^  (1.8) 

£ 


for  the  stress  intensity  factor  k  of  the  regular  field.  X  ,Y„  are  the  components 
of  traction  of  that  very  field,  and  the  integration  in  (I.8)  fs  over  the  line  £ 
which  bounds  the  cross  section  of  V  in  the  (x,y)-plane,  ds  being  the  length 
element  of  £  is  the  projection  of  B  as  well  as  of  C‘  onto  that  plane 

Details  of  the  derivation  of  (I.8)  can  be  found  in  [1,2];  a  different  derivation 
IS  in  L3J.  It  IS  possible  to  extend  (I.8)  to  regular  fields  with  body  forces. 

In  the  special  case  that  the  tractions  appear  exclusively  on  C"*",  C"  in  the  form 
of  a  pressure  distribution  formula  (1.8)  specializes  into 

^  f  ro(s)p(s)ds;  £*■  =  projection  of  C'*' 

£+ 


(1.8-) 


p  =  applied  pressure. 


m  =  v^  on  C 


We  call  the  displacements  Uf,Vf  weight  functions.  They  pemit  to  represent  k 
as  a  weighted  sum  of  the  tractions  Xr,Y^.  The  use  of  a  formula  of  type  (l.8) 

for  the  computation  of  the  stress  intensity  factor  k  is  advantageous  in  two 
respects; 


(1) 


u^,vj.  depend  exclusively  on  the  shape  of  V;  thus  geometry  and  loading 
appear  independently  in  (1.8).  ® 


(2)  the  effort  to  calculate  up,Vf  is  not  higher  than  the  effort  to  calculate 
the  displacements  of  some  regular  field, 

(1-8')  can  be  interpreted 

intensity  factor  of  a  regular  field  responding  to  concentrated 

pre  s  sure 


p(s)  =  6(s-s') 

VT 
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where  &(...)  is  Dirac’s  Delta  function.  For  this  reason  one  could  be  inclined 
to  classify  iii(s)  as  Green's  function.  Unfortunately  the  interpretation  makes 
the  function  m(s)  an  abstract  from  infinitely  many  fields,  each  characterized 
by  a  different  point  s  of  load  concentration.  To  compute  m  that  way  would 
sacrifice  the  advantage  (2)  which  rests  on  the  circumstance  that  m(s)  is  a 
boundary  displacement  of  only  one  field.  The  term  "weight  function"  was  chosen 
in  order  to  avoid  the  misleading  suggestions  associated  with  the  concept  of 
Green’s  function. 

For  some  plane  strain  configurations  of  mode  I  in  which  the  crack  faces 
alone  are  loaded  by  some  pressure  distribution  p(s)  integral  equations  have  ^ 
been  found  [2]  which  link  p(s)  to  the  crack  opening  displacement  v(s)  =  v^.  on  C 
in  the  form  ^ 

f  °  4k^iJ  • 

a 

The  interval  (a,b)  is  identical  with  L(s,t)  is  a  Cauchy  type  singular 
integral  operator.  The  integral  is  taken  as  Cauchy  principal  value.  An  example 

-1 


for  the  Griffith  crack  (-l<x<0;  y  =  0) 
homogeneous  case  p(s)  =  0  admits  the  solution 
q(t)  be  bounded.  If  one  drops  this  condition 
constant  coefficient  becomes  a  solution.  For 
equation  admits  two  solutions  associated  with 
namely 


in  an  infinite  solid.  The 

q(t)  s  0  only  if  one  insists  that 

then  q(t)  =  cm(t)  with  c  as 

the  Griffith  crack  the  homogeneous 

the  crack  tips  x  =  0,  x  =  -1, 


(l.U) 


2.  FIELDS  IN  THREE  DIMENSIONS.  Let  US  now  generalize  the  states  of  plane 
strain  of  mode  I  into  states  of  three  dimensions.  We  shall  assume  a  plane 
crack  in  the  (x,y) -plane.  Figure  2a  shows  an  elliptic  crack  as  example.  The 
faces  are  denoted  by  C^,  C”  and  the  contour  by  C.  In  Figure  2b  an  infinite 
crack  occupying  the  half -plane  x  <  0  is  represented.  We  shall  assume  that  the 
displacement  field  has  mirror  symmetry  with  respect  to  the  (x,y) -plane;  more 
precisely  u,v  are  to  be  the  same  at  points  (x,y,z)  and  (x,y,-z)  while  w  changes 
sign  without  change  of  absolute  value.  This  is  the  generalization  of  mode  I  of 
plane  strain.  Finally  we  confine  the  attention  to  those  fields  which  can  be 
derived  from  a  Boussinesq-Papkovich  potential  G(x,y,z).  This  potential  is 
harmonic,  i.e. 


=  G  +  G  +  G__  =  0 


XX 


yy 


zz 


(2.1) 
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Figure  2a 


z  / 
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. 
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c 

Figure  2b 

Here  and  in  what  follows  coordinate-denoting  subscripts  indicate  partial 
derivatives.  The  displacements  and  stresses  are  derived  as  follows: 

^  =  -  zGyz  -  (l-2v)Gy  , 

w  =  -  zG  +  2(1-v)g 

I  *7  '7  '  rr 


^  ^'^yy]  '  "y  '  -  ^[<  V-  * 


a  =  -  2(i  zG 


zzz  zz 


(2.3) 


For  the  sake  of  a  first  orientation  let  us  consider  the  configuration  of  Figure  21). 
It  admits  in  particular  states  of  plane  strain,  and  the  asymptotic  relations  of  the 
preceding  section . apply  if  the  roles  of  y  and  z  are  exchanged.  In  order  to  exhibit 
a  more  general  class  of  states  ve  set  up 


G(x,y,z)  =  F(x,z)cosxy 


F  +  F  -  =  0  . 

XX  zz 


Defining  polar  coordinates  p,  ‘t'  by  means  of 


pe  =  X  +  iz 


we  can  rewrite  (2.5)  in  the  form 

P^-  +  PF  -  F,,  -  X^p^  =  0  . 


(2.4) 


with  some  real  constant  \  >  0.  The  case  x  -  0  is  that  of  plane  strain.  The 
function  F(x,z)  must  satisfy 


(2.5) 


(2.6) 


It  admits  the  product  solution 

F  =  F*(Xp)cos  -^<1)  with 


F*(t) 


(2.7) 


I,/2  is  the  modified  Bessel  function  of  type  I  and  of  fractional  order  3/2  . 
Altogether  we  can  write 


G(x,y,z)  =  g(x,z)h(xp)cosXy  with 

g(x,z)  =  (xp)^/^  cos  |<t>  =  Re[x(x+iz)]^/^  , 

v,^  +  ^  3  d  sinht 

t  dt  t 


(2.8) 
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h(t)  adMts  an  expansion  in  even  powers  of  t;  moreover  h(0)  =  1.  The  function 
g(x,z)  itself  is  a  Boussinesq-Papkovich  potential.  It  describes  a  state  of 
plane  strain  and  of  mode  I.  The  field  of  displacements  and  stresses  is  regular 
The  y-axis  represents  the  edge  of  the  crack.  As  we  let  p  0  we  approach  the 
edge.  Asymptotic  relations  of  type  (l.l)  and  (1.2)  after  exchange  of  the  roles 
of  y,z  become  valid  with  an  intensity  factor  k  depending  on  y.  This  is  due  to 
the  preponderance  of  g(x,z)  in  the  representation  (2.8)  of  G(x,y,z).  Locally 
the  behap.or  of  the  field  near  an  edge  point  y  is  given  by  the  field  of  plane 
strain  of  g(x,z)  but  modified  by  the  factor  cos\y.  We  list  in  particular: 


k  =  k(y)  =  k(0)cosx  y 

^  ~  ^  '^'2  ^  ( /<■  -  cos  0) cos  ~  cos  $) sin  <(> 


V  =  0(p3/2) 


(2.9) 

(2.10) 


a  =  f(<l>)coS'i  ^ 

f(  0)  =1  _  i  sin-|  sin 
f(0)  =1  +  isiniosin  | 

f(<t')  =  2v 

f ( 4')  =  sin  i  4  cos  ^  4 


where 


for  a  -  a 


for  0=0 


for  0=0 


for  o  =  X 


zx 


(2.11) 


Furthermore  the  stresses  t  t  stay  bounded.  The  special  potential  (2.8) 
induces  no  tractions  on  the^facis  of  the  crack.  This  is  obvious  inasmuch  as 

=  0  follows.  The  displacements  are  ^ 
bo^fd  confined  to  domains 


Still  with  regard  to  Figure  2b  let  us  consider 
G(x,y,z)  =  Erfc(q)e*cosy  ;  q  =  yip  -  cos-i  4 


(2.12) 


The  function  G  is  harmonic, 
that  q  as  well  as  the  product 


Writing  for  simplicity  Erfc(q)  =  Q(q)  and  observing 
e  cos  y  are  harmonic  functions  we  find 
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(2.13) 


=  e^cos  y  (V^Q  +  2Q^)  , 

"  ■  2qQ'(q)(q^+qJ)  =  -  qQ'(q)/p 
=  Q'(q)q^  =  qQ'Ap 

and  altogether  =  0  as  asserted.  The  potential  (2.12)  is  periodic  in  y  with 
the  period  2rt.  Inspite  of  the  factor  e^  the  potential  as  a  whole  and  all  of  its 
partial  derivatives  go  to  zero  as  p  -^oo.  For  small  p  we  may  use 

G  =  e^cos  y  (l  -  Q.  O(q^) )  (2.l6 


in  order  to  determine  the  asymptotic  behavior  of  displacements  and  stresses  as  we 
approach  the  edge  of  the  crack.  The  function  q  can  be  taken  as  Boussinesq- 
Papkovich  potential;  as  such  it  leads  to  a  state  of  plane  strain.  The  state  has 
di splacements 


u 


w 


u 


w 


(2.17) 


A  comparison  with  (1.4)  shows  that  Ug^Wg  have  the  asymptotic  properties  of 
the  displacements  of  a  fundamental  field  of  plane  strain  and  of  mode  I,  Going 
back  to  (2.16)  we  can  expect  the  potential  q  to  dominate  the  behavior  of  G  in  the 
approach  p  0,  More  precisely  we  find 

u  =  a(y)u  ,  w  =  a(y)w  with  a(y)  - - ^  cosy  (2.l8) 

s  s 


as  asymptotic  representations  of  u,w  in  the  case  of  G. 

The  field  of  G  has  vanishing  shearing  stresses  'the  (x,y)-plane. 

We  assert  that  o  vanishes  on  the  faces  of  the  crack.  As  oefore  we  find 
z 


=  -  2|i£G  with  +  G^  on  C  ,  C 


(2.19) 


But 


G  = 


X 

e  cos  y 


on 


and  CG  ■=  0  follows. 


The  displacements  and  stresses  go  to  zero  as  p  . 
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The  potential  G(x,y,z)  of  (2.12)  gives  rise  to  other  potentials 
G(\x,  \(y~y'),  \z)  where  X.,  y'  are  real  constants  and  also  \  >  0.  These  poten¬ 
tials  can  be  linearly  combined  in  a  finite  number  of  terns,  the  combination  co¬ 
efficients  to  be  real.  All  combinations  form  a  real  linear  space  of  infinite 
dimension.  Each  potential  of  this  space  yields  a  field  of  displacements  and 
stresses  which  we  now  designate  as  fundamental  field.  This  is  a  generalization  of 
fundamental  fields  of  plane  strain  and  justified  by  the  asymptotic  relations  of 
type  (2,18)  as  well  as  by  the  absence  of  tractions  on  the  crack  faces. 

T , .  Y?  ■turn  next  to  Figure  2a  and  disregard  temporarily  that  the  crack  is  to  be 
elliptic.  More  generally  we  admit  as  crack  contour  C  any  rectifiable  Jordan 
curve  of  continuous  tangent.  The  Boussinesq-Papkovich  potentials  associated 
with  this  crack  configuration  can  be  represented  as  harmonic  potentials  of  single 
layers,  more  precisely  in  the  form 


G(x,y,z)  =  -  lfjr(i_v) ^d|dT]  with 


E  =  (x-  1)“^  +  (y-  q)^  +  z' 


(2.20) 


The  integration  is  over  one  of  the  crack  faces.  Of  the  density  function  f(f  ri) 
we  assume  continuity  inside  C  and  furthemore  for  interior  points  (|,q) 


(2.21) 


where  is  some  constant  and  where  d  is  the  distance  from  the  contour  C  of 


(IjTj).  Fomulas  (2.3)  lead  to 


w  =  2(l-v)G^  =  f  on  c’ 
=  -f  on  C' 
a  =  -  g(x,y)  on  c'*',  C" 


] 


with 


(2.22) 


{(2.23) 


d|  dT] 


As  in  (2,19),  A  stands  for  the  Laplacian  operator  of  the  (x^y) -plane.  The 
function  g(x,y)  represents  a  pressure  distribution  within  the  crack/  The 
stresses  Tzx»  in  the  (x,y)-plane  and  in  particular  on  the  crack.  In 

e  nontrivaal  case  fCg^rj)  ^  0  we  call  G  and  the  associated  field  fundamental  if 

C+,  C-,  i.e.  if  g(x,y)  s  0.  We  call  G  allTthi - 

associated  field  regvto  if  f(g,Ti)  satisfies  a  condition  more  stringent  than 
\  ^  •  ^1 )  y  name  J.y 


|f(|,Tj)  I  < 


(2.24) 
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where  is  a  suitable  constant.  Let  s  denote  the  arclength  on  C'  ^°^nted  fron 
some  -Doint  of  C  in  the  counterclockwise  sense  as  one 

Diane  In  the  neighborhood  of  any  point  s  of  C  we  expect  the  field  of  ^ 
Soothe  asymptotic  behavior  of  a  field  of  plane  strain  for  an 
S^e  crack-  the  latter  must  have  the  tangent  at  s  as  edge  and  must  follow  the 
l^r  no^i  S V  .t  in  the  case  of  a  regular  field  the  ap^totic 
behatior  will  be  detemined  by  a  stress  intensity  factor  It  =  )t(s).  In  thi 
context  we  list  in  particular  the  asymptotic  relations 

.  ;r  +  i  .1  V2  _+  (2.25) 


w  =  k(  s) 


a,  =  li(s)(2d) 

z 


.1/2  for  z  —  0  and  points  outside  the  crack 


2.26) 


As  for  the 


fundamental  field  we  merely  write  the  analogue  of  (2.25)  in  the  form 


w  =  P(s)d' 


(2.27) 


where  the  intensity  function  p(s)  depends  on  the  fundamental  field. 

In  the  case  of  plane  strain  Betti's  theorem  of  reciprocity  was  applied  to 
the  ?orScd  by  a  regular  and  by  a  fundamental  field  The  p^cedure 

the  mixed  ene^  iom  ,  The  same  method  can  be  used  for  the  con- 

Xsd  "bo  f  onnu.lQ'S  /  y  A  *  ^  -in  "Pn  Tm 

figuration  of  Figure  2a  [2].  This  yields  the  analogue  of  (1.8  )  in  the  form 


k(s)p(s)ds 


M(x,y)g(x,y)dxdy 


(2.28) 


V/ 

Mfx  v)  is  the  normal  displacement  w  of  the  fundamental  field  on  C  .  The  factor 
Sx,y  is  the  pr“sure  ithin  the  crack  of  the  regular  field.  It  is 
me  fundamental  field  does  not  permit  to  determine  the  function  k(s).  We  need 
infinitelv  many  or  -  for  practical  purposes  -  a  sufficiently  large  nmber  o 
linearly  independent  fundamental  fields.  In  order  to  find  sue  le  s  we  m 


Figure  3 
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(2.29) 


:olve  the  homogeneous  integro-differential  equation 


0  =  A 


fj [(x- +  (y-T))^]'^'^%dT] 


for  nontrivial  density  functions  f(|,Ti). 

of  possess  a  fundamental  field  for  the  crack  configuration 

of  Figure  2a  we  can  construct  a  fundamental  field  for  a  finite  elastic  body  with 
the  same  crack.  Figure  3  shows  a  sphere  S  around  the  origin  The  elastic  bodv 
IS  bounded  by  S  ajd  by  the  fanes  ct  C.  In  analogy  to  the  ionSLtS  (l\T 
we  add  to  our  fundamental  field  for  Figure  2a  a  regvilar  field  for  Fieure  ^ 
sunh  that  the  regular  field  hL  no  trantlSS  on  the  craL  tSS  iis 

tractions  on  S  annihilate  those  of  the  fundamental  field  of  Figure  2a.  The 

tractions  on  crack  faces  and  on  S;  it  displays  the 
as^ptotic  behavior  of  the  initial  fundamental  field  near  the  e(^e  C'  of  the 

r^i.  a  fundamental  and  a  regular  field  for  Figure  3,  both  of  mode  I 

Let  the  regular  field  be  generated  by  a  distribution  of  Laotians  on  S  Under 
these  circumstances  the  analogue  of  (1,8)  is  unaer 


_/k(s)3(s)ds  = -~  JJ(^fX^  +  v^Y^  +  w^^)dS 

C  ^  To 


(2.30) 


where  Uf ,Vf,w^  are  the  displacements  of  the  fundamental  field  and  Xv,  ¥»  Z-^  the 
components  of  traction  of  the  regular  one.  The  fundamental  poSntfe  a’ in  (fl2) 
can  be  used  in  order  to  construct  an  analogue  of  foimula  (2.i8).  sSce  G  his 
period  2jt  one  should  apply  the  associated  fundamental  field  to  the  analysis  of 
«y)  of  a  ragular  field  rith  the  s».e  period  and  the  same  sj,»etr^  SS  SsMct 

to  y  Morwyer  it  mill  suffice  to  consider  a  slab  0  <  y  <  pStSr  dSnf 

can  be  left  to  the  reader.  -  ^  ruruner  aetaiis 

P.  ™^-SHAPED  MD  elliptic  crack,  we  return  to  Figure  2a  and  interpret 

^  ellipse  with  half-axes  a,b.  The  ellipse  has  the  Equation 


E(x,y)  =  1  -  x^/a^  -  y^/b^  =  o  . 
The  w-zeros  of  the  function 


(3.1) 


T(w;x,y,z)  =  1 


2^ 

a  +  CO 


b^  +  o) 


SSrtanrrSe!*^  elliptic  coordinates.  The  largest  m-root  of  T  =  0  will  play  an 


At  this  juncture  we  turn  to  the  penny- shaped  crack  by  letting  b  -  a.  Without 
essential  loss  of  generality  we  assume  a  =  1.  Cylindrical  coordinates  ^ 

will  be  useful.  We  have  here  x  =  rcos  0  and  y  -  rsin  6.  The  function  T  taice 

special  form 

^  Z  (3.2) 


T  =  1  - 


1  +  0)  0) 
The  mapping  (see  also  [?]) 


r+iz  =  cosh  (s  +  it)  ;  s>0,  - 

permits  to  represent  pairs  (r,z)  by  pairs  (s,t)  in  accord  with  Figure  4* 


(3.3) 


r  <■ 


i 

{  . 

t  >  0 

.....Cl 

'  )  1 

=  0 

c” 

t  <  0 

Figure  4 

The  representation  is  unique  whenever  z  /  0  or  r  >  1.  For  points  of  the  cracky 
two  different  representations  appear  which  permit  to  distinguish  between  C  and  C 
From  (3.3)  it  follows  that  (3.2)  has  the  roots 


2  .2 

=  sinh  s  ,  Wg  =  -  sin  t 

The  following  relations  are  useful: 

r  =  cosh  s  cos  t ,  z  =  sinh  s  sin 

s  =  t  =  sinh  s  cos  t /n,  t  =  - 
r  z  ^ 

2  2 

N  =  sinh  s  +  sin  t 
V^(s)  =  (f"  (s)  +  tanh  sF'(s))/N 


(3.4) 

;  (3.5) 

s  =  -cosh  s  sin  t  /N 
^  (3.6) 

(3.7 
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(3.8) 


-  l/sint=  on  c"*^ 

=  -(1  -  on  C" 


Furthermore 


sinh  s  <  <  cosh^  s  =  1 +  sinh^  s 


(3.9) 


In  what  follows  we  establish  an  infinite  fainily  of  fundamental  potentials 
without  solving  (2.29)  directly.  We  set  ««ieniai  potentials 


Gjj(xjy,z)  =  F^(r,z)  cosn(0-0')  and 
Fjj(r,z)  =  r”Hj^(s)  for  n  =  0,1,2,... 


(3.10) 


\?onstant  which  may  depend  on  n.  We  shall  try  to  make  G„  a  funda¬ 
mental  potential  through  a  suitable  choice  of  Hn(s).  Writing  altogether 

Gn  =  r^cos  n  (0-  0')  •  h^(s) 

we  observe  that  the  factor  preceding  is  a  haimonic  function,  i.e. 

V^(r’^cos  n  (0  -  0'))  =  A  r’^cos  (0  -  0'  )  =0 


This^in  turn  together  with  (3.6),  (3.7)  yields  after  steps  of  an  elementaxy 


(3.11) 


V^G^  =  r""  cosn(0-  0’)[H^>(s)  +  (2n  +  l)tanhs  •H^(s)]  /n 


Consequently  we  must  solve 


Hj;’(s)+  (2n  +  l)tanhs  .H'(s)  =0 


(3.12) 


(3.13) 


We  find 


H^(s)  = 


cosh^^’^^^s 


(3.14) 


»ith  so.  constant  c.  Integration  of  Hi(s)  and  a^spaclal  choice  of  c  yield 

1 


With 


H  (s)  =  a 
n  n 


a  = 
o 


1 

-  arctan  ( sinh  s)  -  a  sinh  s  J!  _ _ 

^  O  ^  PV 

k=l  2kD!j^cosh  s  j 

- -  ^  =  a  (-1)^ 

l.v)Vi  ^  (  k  )  • 


(3.15) 
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We  leave  the  verification  of  (3.15)  to  the  reader.  Note  that  the  definition  of 


the  coefficients  is  independent  of  nl 


Having  established  the  potential  we  check  on  some  of  its  properties. 
Due  to  (2.2) 


w=2(l-v)G  on  C 


(3.16) 


Now  (3.1^)  and  (3.8)  yield 

G  =  r’^cos  n  (©  -  6' )H'(0)s 
nz  n  z 

=  cr''cosn(0-e*)-(l-r^)"^^^  on  (3-17) 

In  the  construction  of  we  have  chosen  the  constant  c  of  (3.1^^)  such  that 

•^(l_v)c  =  1  (3.1^  ) 

This  choice  implies 

P  =  cos  n (6  -  0' ) 


for  the  intensity  P(s)  associated  with  We  stiU 

not  induce  tractions  on  the  crack.  The  nature  of  G  makes  it  obvious  that 
T,,,  vanish  on  the  (x,y)-plane.  As  for  we  olaserve  that 


G_  =  r'^cos  n  ( 6  -  6' )Nj^(*^)  on  C  ,  C 


(3.18) 


n 


Due  to  (2  19)  and  (3.11)  vanishes.  Finally  it  can  be  established  that  the  , ^2 
Sel^of  G  has  vanishing  displacements  and  stresses  at  R  =  ~  where  R=  (r  +  z2)  /  , 
^oreov^r  ^GeltrSSs  ha^  th^  order  of  H-3  for  large  ' 

are  in  the  order  of  R-2.  In  this  context  we  refer  to  (3-9)  with  the  consequence 
R  ~  l.e^  and  to  (3.l4),  (3.15)  with  the  consequence 

H  (s)  =  0(e-(2-^l)^) 


(3.19) 


n 


for  large  R.  All  of  this  is  compatible  with  the  asymptotic  behavior  for  large  R 
S  th^Seld  of  the  potential  of  a  single  layer.  G^  admits  a  representation 
(2.20)  with  the  density  function 


f  =  f  =  V^r'^cos  n( 0  -  G ' )  •  (l-r^) 


(3.20) 


n 
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Formula  (2,28)  takes  the  special  form 

2it  1 


2it  1 

J  i'(e)<=osn(e-e’)ae.|j  jAi.r^r^Ko.n(0.e')s(r,0)rine 

k(e)  ?  permit  to  calculate  the  Fourier  components  of  k(©)  and  thus 

k(e).  One  can  also  establish  the  following  formula  (see  Figure  5)  ^ 


21) 


k(e')  =  ~JjM(T,e,e')g(r,e)Tdrde  where 


M  =  (1-  r‘ 


d2  = 


1+r  -2rcos(e-e') 


(3.22) 


In  this  case  the  intensity  ^(s)  is  a  Dirac  delta  function  on  C.  An  extension 
f  the  concept  of  weight  functions  in  the  nature  of  the  case  pp)  Vint:  -koo 
suggested  by  Rice  [3]  in  general  fonn.  Fomula  (3.22)  appeLs  ;fL 

Formas  for  the  periy-Saped  crack  o5  t^r' 
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\T*-r  "n-l 

-  (2)  -  0;  [D(D  +  1)-Il(n  +  1)]F^  +  F^^^j  -  0 


(3.23) 


D  =  r 


The  operator  D  preserves  harmonicity.  The  third  relation  expresses  the  haimoni- 
Sty  S  G„  without  reference  to  the  e  -  tern.  It  is  possible  to  establish  the 
fomulas  of  the  first  line  by  merely  using  ^  0  and  the  asymptotic  behavior  of 


w  on  the  crack  in  the  case  of  any  G  . 


Returning  to  the  general  elliptic  crack  we  make  an  extensive  use  of  available 
literature  [?  -  ll].  In  particular  Dyson's  formulas  [8,91  will  applied. 
Following  Dyson  we  write  the  density  function  f  in  (2.20)  in  the  form 


f(x,y)  =  -  4jt(l-v)h(x,y)E^"^/^(x,y) 


(3.24) 


where  E  is  the  function  in  (3.1).  We  are  primarily  interested  m  the  cases 
\  =  0  and  X  =  1;  h(x,y)  is  to  be  a  polynomial  in  x  and  y.  Under  these  circum¬ 
stances  the  case  \  =  1  will  yield  a  regular  potential  and  the  case  \  -  O  a 
fundamental  one  for  properly  chosen  h(x,y).  Dyson  himself  atots  more  ^ 

h(x  v)  The  case  \  =  0  is  pertinent  to  the  analysis  of  an  electrically  charge 
disk;  so  far  it  has  not  been  applied  to  elastic  analysis.  We  introduce  the 
following  denotations  and  symbols: 


Q(s)  =  s(a2+s)(b^+s);  q(s)  =  Q^/^(s)  >  0 


for  s  >  0 


p  =  -27- 

a  +  s 


b  +  s 


D-  + 

p8x2  'la/ 


(3.25) 


(3.26) 


We  denote  the  largest  co-root  of  T  =  0  by  t;  it  is  nonneptive.  These  s^bols 
and  denotations  are  unrelated  to  fomerly  defined  quantities.  Dyson  and^ 

Hobson  have  shown  that  the  potential  G  of  (2.20)  can  be  rewritten  as  a  single 

integral,  n  00 

jtabr(\+-5)  r 

G(x,y,z)  =  -3—7 J  fij  \l>(px,qy)  •  ds  ,  (a.  «  s)  (3.27) 
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where  is  the  following  differential  operator: 

sW 


00 


n=o  4  n:(\+l)(x+2)...(x+n) 


(3.27') 


The  symbol  T  denotes  the  Gamma  function.  We  replace  \r(\)  by  unity  for  X  =  0 
Since  h(x,y)  is  a  polynomial  only  a  finite  number  of  terms  in  (3.27)  have  to  be 
used.  M^h(px,qy)  is  therefore  a  polynomial  in  x,y  whose  coefficients  are 
functions  of  s.  Let  us  now  consider  G  on  the  crack;  in  this  case  t  =  0  and  thus 


,  _  «abr(x  +  |)  f 
"  r(i)xr(x)  i  ^  \h(px,qy)ds  valid  for  C  ,  C" 


r(^)^r(x) 


(3.28) 


he  cases  X  -  0,  X  -  1  the  function  T  is  a  polynomial  in  x,y.  Altogether 
we  see  now  that  G(x,y,0)  is  a  polynomial  in  x,y  on  the  crack,  and  so  is 
-  cr  =  A24G(x,y,0)  =  g(x,y).  We  can  write 


g  =  ^xh 

where  denotes  a  linear  operator  which  transform  the  polynomial  h  into  a 
polynomial  g.  The  nature  of  the  mapping  depends  on  X  , 


(3.29) 


Case  X  =  1 


^  of  ordinary  elasticity.  £.  maps  the  real  linear  space  of 
all  polynomials  of  degree  <  m  (real  coefflcientsT  into  itself.  ;e,h  =  0  for  some 
h  75  0  cannot  happen.  The  mapping  is  therefore  1-1;  given  g  there  is  a  unique  h 

e  mapping  does  not  necessarily  transform  homogeneous  polynomials  into 
homogeneous  ones. 


Case  X  =  0 


v>  u  ^  degree  m  then  g  -  aC^h  has  degree  not  higher  than 
0  for  h  ^  0  can  happen.  We  call  such  an  h  a  fundamental 
leads  to  a  fundamental  field  G.  Trivial  cases  are-  h  =  1  h  = 


m-2.  The  case 
polynomial.  It 
X,  h  =  y,  h  = 


xy. 


Here  the  reader  is  reminded  of  the  definition  of  the  degree  of  a  polynomial  h(x  v) 
l  a  monome,  i.e.  h  =  cx^n  ^th  c  ^  0  then  tL  degree  of  I  ITlln.lf 

h  IS  a  combination  of  monomes  we  look  for  the  monome  of  highest  degree;  that 

degree  of  h.  A  polynomial  is  homogeneous  if  all  of  its  monomes 
nomial  '^lues  of  h  on  E  =  0  are  given  by  the  Fourier  poly- 

E(aoo3  bsL  e)  aS;  ’’  '"'era.  <  H  If  h  has  aegree  N.  Bote  that 


For  each  degree  m  >  2  two  fundamental  polynomials  h(x.v) 
constructed  as  follows:  Set 


of  degree  m  can  be 
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(3.30) 


h(x,y)  =  x”*  +  h^(x,y)E(x,y)  where 

Since  t  has  degree  <  tii-2  the  polynomial  h  is  of  degree  <  m-2;  consequently 

hiE  has°degree  <  m,  and  the  degree  of  h  cannot  exceed  m.  But  h  -  x  =  a  cos  y 

on  E  =  0;  this  is  a  Fourier  polynomial  of  degree  m.  The  degree  of  h(x,y)  cannot 

he  less.  Thus  h  is  seen  to  have  exact  degree  m.  Now  h]_E  and  \  =  0,  hj^  and  \  -  1 

define  the  same  potential.  Hence 

a!^(hj^E)  =  if^h^  and  £^h  =  0  .  (3.31) 

This  establishes  h  as  a  fundamental  polynomial  of  degree  m.  In  the  same  vein 
we  construct 


h(x,y)  =  x“"V  +  h2(x,y)E(x,y) 


where  ^-^2 


=  -  X 

o 


m-1. 


(3.32) 


With  (3.30),  (3.32)  we  have  obtained  two  linearly  independent  fundamental  poly¬ 
nomials  of  degree  m. 


The  application  of  the  operators  ;£q, 
The  following  coefficients  are  needed  for 
nomials : 


involves  certain  elliptic  integrals, 
the  construction  of  fundamental  poly- 


(^)  = 
mn 


00 


/ 


O 


^-1/2 +  ipm+l/2^n+ 1/2^3 . 


(3.33) 


m,n,£  run  through  the  nonnegative  integers.  The  coefficients  satisfy  the  recur 
sions 


2  2  2 

C  =  TC^  ,  +  (1-T  )c  With  T  =  a  /(a  -b  ) 

mn  m-l^n  m,n-x 


(3.34) 


Up  to  degree  3  fundamental  polynomials  can  be  homogeneous.  This  is  no  longer  so 
from  degree  four  on.  We  give  some  polynomials  below: 

Fundamental  polynomials  up  to  degree  4 


m  =  0:  h  =  1 

m  =  l:  h  =  x,  h  =  y 

m  =  2:  h  "  -  Cj^/,  h  =  xy 
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m-3:  h  =  c  -  3c„  xy2,  h= 


-  3c  -X  y 


m  =  4:  h  =  -  Cp,xy^,  h  =  ax^  +  3x^y^  +  yy^ 


4  2  ? 

yy  +  Bx"^  +  ey^ 


with  the  following  coefficients- 


a  = 

'll  2%2' 

c 

,  P  » -3  2o  02 

'ai-'is  -5 V"  20^2 

5'^2o-2<=21  -^“oa  ■- 

^^2o"^‘'21  °21”''i2 


2(  T  -  l)  |'(a  -  3)a^  +  y\)^  ^  j 


€  =  -  -it  jaa^  -  0-;y)b^  ]  . 


crLfthrintLsity  f^Stiorp^is^  ^^Inction  M  = 


(3.35) 
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THE  BUCKLING  PRESSURE  OF  AN  ELASTIC  PLATE  FLOATING 
ON  WATER  AND  STRESSED  UNIFORMLY  ALONG  THE  PERIPHERY 
OF  AN  INTERNAL  HOLE 


SlXTonsuke  Takagi 
Corps  of  Engineers 

U.S*  Am^y*  Cold  Regions  Research  and  Engineering  Laboratory 

Hanover,  New  Hampshire 


INTRODUCTION 

To  test  the  strength  of  an  ice  sheet  floating  on  water  the 
following  measurement  is  regularly  performed  (Zahilanski  et  al.,  l): 

Dig  a  hole,  place  a  vertical  pile  of  various  shapes  and  push  it  Breaking 
through  the  ice.  However,  the  mechanism  of  the  failure  is  not  yet 
clarified,  and  the  interpretation  of  the  data  is  not  yet  satisfactory. 

To  understand  the  Basic  mechanism,  an  ideally  simple  case  is  chosen  and 
analyzed  in  this  paper. 

A  paper  of  the  same  title  was  presented  at  the  20th  Conference  of 
Army  Mathematicians  (1974).  When  numerical  work  was  attempted  in  the 
summer  of  1975,  it  was  found  that  the  analysis  presented  in  the  20th 
Conference  did  not  work  as  expected.  A  new  analysis  as  reported  in  this 
paper  was  developed,  and  the  numerical  computation  was  carried  out. 
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1.  The  Problem 


Suppose  a  thin  elastic  plate  floating  on  water,  extending  horizon¬ 
tally  to  infinity,  and  stressed  with  uniform  horizontal  pressure  along 
the  periphery  of  an  internal  circular  hole.  We  are  interested  in  formu¬ 
lating  the  huckling  pressure  and  the  deformation  at  the  failure. 

The  vertical  deflection  W  of  an  elastic  plate  that  rests  on  a  liquid 
and  is  subjected  to  a  vertical  load  q  and  the  horizontal  stress  of 


components  and  717  is  governed  by  the  differential  equation , 


It 

W  +  yw  =  q  +  N  ^  +  21] 


2  2 
9  W  „  3  W  ,  , 


where  D  is  the  flexural  rigidity  and  y  the  specific  weight  of  the 
liquid  (Ref.  2).  Let  r  be  the  radial  distance  from  the  center  of  the 
hole.  In  our  problem  q  =  o  and  the  deformation  is  cylindrically  sym¬ 
metric  around  the  center  of  the  hole.  Then  (l.l)  becomes 


^  w  +  w  =  -  In 


2 

d  W 


dr 


2  r  dr' 


where  I  -  is  the  characteristic  length,  and  N  and  N^^  are  the 

rr  60 

radial  and  hoop  horizontal  stresses  in  the  plate  (see  Appendix  B). 

Following  the  usual  treatment  (Ref.  2),  we  assume  that  the  horizontal 
stress  components  equilibrium  by  themselves.  Then 

they  are  derived  from  a  biharmonic  function  *  by 


N 


XX 


ifi 


N 


yy 


35C 


N  = 
xy 


In  "bhe  general  polar  coordinates  they  are 


,,  =  +  1-  lA 

rr  r  3r  ^2  ,^2 


(l  li\ 

86/ 


In  our  problem  is  a  function  of  r  only  and  must  tend  to  zero  when 
r  becomes  infinite.  Then  they  are  formulated  as 

"rr  “ 

"re  =  ° 

"ee  '  • 

Where  is  a  constant.  Constant  A  is  positive  because  is  pressure 

Instead  of  A  we  introduce  nondimensional  constant  a  and  express  uhe 
stress  components  as 


>  -2 

-ay^o  r 


,lj  -2 

"ee  '  ■ 


(1.3) 


Introduce  the  nondimensional  length  x. 


=  rl 


(1.4) 
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In  this  way  (1.2)  becomes 


_  1-a 

dx^  ^  dx^  dr^ 

At  X  =  «>^  the  condition 


1-a  di^? 
3  dr 

X 


w 


w  = 

dzJ 

dw 


0 

0 


0 


(1.5) 


(1.6) 


must  be  satisfied.  At  a:  =  where  x^  is  the  value  of  x  at  the 
periphery  of  the  internal  hole,  we  consider  three  conditions:  (l)  the 
clamped-edge  condition 


w  = 

du  _ 
dr 


0 

0 


2)  the  simple-edge  condition 


W  -  0 


2L  ^ 
X  dr 


0 


(1.7) 


(1.8) 


and  (3)  the  free-edge  condition 
2 

d  w  V  dit?  _ 

dx^  X  dx 


d  /  ^  _1 

\dx^  ^  ^ 


0 


{  (1.9) 


where  v  is  Poisson^ s  ratio. 

The  second  equation  of  (1.8)  and  the  first  equation  of  {I.9)  are 
found  from  =  0.  The  second  equation  of  (I.9)  is  derived  from 
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Q  +  (l/r){3M^g/9e)  =  0.  The  effect  of  horizontal  stress  must  he 
counted  in  Q  .  In  the  rectangular  coordinates  x,  y,  shears  and 


are  given  hy 


W 


Q. 


Q. 


XX 


■dM 


xy_ 


dX 


9y 


-  ,, 

+  N  "5 
XX  3x 


m  3M 

+  —UK 


*  "xv  i  ♦  %  Bt, 


ff  — 

xy  3y 


3z^ 


(1.10) 


''y  dx 

These  equations  are  found  hy  extending  Hitenyi's  (3)  one-dimensional 
treatment  to  two-dimensional.  In  polar  coordinates  r,  0,  components  of 
shear  Q  and  Q.  are  given  (see  Appendix  A)  hy 

P  D 


Q. 


Q. 


-  M 
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,7  ^  1  9£  717 

3r  ^rr  r  36  ^r6 


+  2  1  +  ^  —  N 

3r  r  ^rQ  ^  r  36  32’  r6  2’  30  66 


I  (1.11) 


Constant  a  is  the  eigenvalue  to  he  determined  to  satisfy  the  boundary 
conditions  at  x  =  x^.  The  first  step  for  the  solution  of  this  eigen¬ 
value  problem  is  to  discover,  given  a  positive  number  a,  two  real 
functions,  w^{x)  and  w^{x),  that  are  the  solutions  of  the  differential 
equation  (1.5)  and  meet  the  boundary  conditions  at  x  =  "  in  (1.6)  hut 
are  not  restricted  at  x  =  Xq  in  any  way.  We  call  them  the  fundamental 
solutions.  We  shall  find  them  later  in  the  following  form. 


CO 

r 

/2 

/  2  k  \ 

/2 

/  2  ^  ^ 

=  1 

•'l 

[r  +  2’  -Ij 

+  \p  +  p  -y 

-l+^ 

- X2’  -1_ 

V2  1+  2 

e  r  -1  dr* 


(1.12) 
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The  second  step  is  to  express  the  fundamental  solutions  as  power 
series  of  x.  Let  (^)  (/tj  =  q,  1,  2,  3)  be  the  Fuchsian  type  solu¬ 
tions  of  (1.5)  relat-Lve  to  x  -  0.  vie  shall  find  linear  combinations . 
3 


Wj^ix)  = 


=  X.  '■to' 


(1.13) 


by  detenriining  constants  by  use  of  (1.12).  The  solution  w(u:)  i 


a  linear  combination  of  the  fundamental  solutions. 


=  A  +  B  w  (x) 


(l.lii) 


The  third  step  is  to  solve  the  simultaneous  equations  of  ^4  and  B 
that  are  found  by  substituting  (l.ll+)  into  the  boundary  conditions 
(l-7),  (1.8),  or  (1.9)  ao  X  =  x^.  If  a  root  of  the  algebraic  equation 
found  by  letting  the  determinant  of  the  simultaneous  equations  equal  to 
zero  is  positive,  the  root  gives  x^.  Our  problem  is  then  solved, 
la.  Abstract  of  the  result. 

The  main  feature  of  the  nxmerical  result  is  as  follows: 

1.  Buckling  takes  place  under  the  free-edge  condition.  Buckling 
does  not  take  place  under  the  clamped-edge  and  the  simple-edge  conditions. 

2.  Eigenvalue  a  under  the  free-edge  condition  is  found  in  the 
2 

range  1-u  <  <==,  where  v  is  Poisson's  ratio  of  the  elastic  ice  plate. 

When  a  -  1-v  ,  root  x^  is  equal  to  zero.  Analysis  presented  here  is 
complete  for  the  case  1-v^  i  a:  ^  2,  but  not  complete  for  the  case  2  <  a  <  «. 
It  is  believed  that  the  resiilt  presented  in  this  paper  can  practically 
cover  a.11  bhe  cases  of  our  interest. 

3.  Buckling  under  the  free-edge  condition  takes  the  shape  as  shown 

in  Figure  5  and  6.  (a  is  restricted  to  1-v^  ^  2).  This  shape  of 
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deformation  is  observed  frequently  in  laboratory  experiments  and  field 
tests.  Therefore  we  may  conclude  that  buckling  is  an  important  mechanism 
of  failvire. 
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Their  individual  forms  are: 

fU)  ,  y  ^ 

“  r^^24^(3-y)J7|^3+u)j 


iin 


4U)  =  X 


(-1)”  r(|(?--))  rfe(5-^u)’ 


l+^''(2n+l)I  r(n+i(5-y))  r((n4^(5+y)j 


ltn+2 


(2.M 


(2.5) 


4+l(^) 


i(3±p)] 

1  r| 

)) 

^n+l+y 

r^n+l  (2+y) 

)  ■■ 

(n+i(3±y)j 

1  r| 

/  1,  j 

n+r(  5+y ) 

[  k  -  t 

(2.6) 


where  fe  1,  2.  In  (2.6)  we  have  introduced  the  convention  that  the 


upper 


or  lower  of  the  double  sign  +  (or  +)  should  be  taken  according  to  ;^  =  1 


or  2,  respectively.  This  convention  is  observed  throughout  the  paper. 
The  main  objective  of  PAST  I  is  to  determine  the  fundamental  solution, 
i.e.  to  determine  in  (1.13). 

Differential  equation  (1.5)  has  an  irregular  singularity  at  m  =  «. 
In  other  words,  the  solution  relative  to  m  =  ~,  say  f  (x) ,  can  be  found 
in  the  form 


/(a;)  =  e 


-Xx 


1 

V  2 


n=o 


where  A  satisfies  A  +1-0.  The  series  ^  p  x  ^  In  this  equation 

n-o  ^ 

IS  asymptotic  and  divergent  in  this  case.  Therefore  this  equation  does 
not  provide  any  means  for  determining  >1-  in 


3.  Count our  Integral  Solution 

In  order  to  find  the  fundamental  solutions,  (1.5)  must  be  transformed 
by  means  of  the  contour  integral. 
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y(c) 


(3.1) 


where  L  is  a  contour  in  the  complex  plane  of  C  that  shall  he  determined 
to  let  a  solution  of  (I.5)  satisfy  the  boundary  condition  (I.6)  at  a;  =  “ 
Following  the  usual  procedure  (ince  (i+),  pp.  187-I88),  one  arrives  at 
the  differential  equation  of  y  (?), 

(1+^^)  +  10^3  ijL  ^  (23+a)c^  ||  +  (9+3n:)cv  =  0  (3.2) 

d?^  dr 

The  contour  L  selected  for  this  solution  is  shown  in  Figure  1. 


To  find  the  solution  relative  to  ?  =  “,  let 

C  =  er 


where 


g  =  exp(3Tii/*t)  • 


(3.3) 

(3.U) 


Then  the  equation  (3-2)  becomes 

(r^-l)  ^  +  lOr^  ^  +  (23+a)  S  +  (9+3a)ry  =  0 

dr^  dr  ,  , 

(3.5) 

This  equation  has  a  regular  singular  point  at  r  =  ®.  The  indicial 

numbers  X  (m  =  0,  1,  2)  at  r  =  “>  are: 
m 

X^  =  2  +  p 
X^  =  2  -  u 

where  p  is  given  by  (2.l).  The  solution  corresponding  to  the  indi¬ 

cial  number  X  is: 

m 


(m)  -X  -Vi 
^  ^ 


(3.6) 
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vhere 


n-1 


p=0 


( hp+X^+2-u ) ( hp+X^+2+p ) 


-1 


(3.T) 


The  contour  L  must  he  such  that  the  point  C  =  3  is  a  branch  point 
of  y(c)-  This  condition  is  satisfied  by  the  series  as 

shown  by  (3.10)  below.  These  two  series  can  be  expressed  by  means  of 
hypergeometric  series  F  (  ,  ,  ;  ; )  as 


y^(r)  =  F  (l++y)/l|;  (2+^y)/2 


;  . 


(3.8) 


The  hypergeometric  series  are  summed  up  by  use  of  the  formula 

]  l-2a 


F(a,  i  +  a;  2a;  z)  =  2 


2a-l 


(l-z) 


-1 

2 


1  + 


(3.9) 


fe  =  1  or  2,  reduces  to 

-1 

Vj^(r)  =  (r»^-l)  ^ 


[Handbook  (Ref.  5)  p.  556,  Formula  (15.1.1^)].  Thus  Vy{v)  ^  where 

(l/2)(i’^  +/?^)j  (3.10) 

This  actuation  shows  that  C  =  3  is  a  branch  point.  Formula  (3*9)  can 

be  proved  by  showing  that  the  one  on  the  right-hand  side  satisfies  the 
hypergeometric  differential  equation  of  the  one  on  the  left-hand  side 

and  also  that  they  satisfy  the  same  initial  conditions  at  2  =  0. 

Suppose  that  v^{r)  on  one  of  the  branch  of  L  in  Figure  1  is 

given  by  (3.10).  Then,  vAr)  on  the  other  branch  A  B  of  L  is  given  by 
“1  r  _  1  ±u/2 

-(r**-l)  ^  (l/2)(jf’^+/r'‘-l) 


Thus  one  finds  the  integral  solution, 
F(, 


=  J  +|?-l)  ^  +  (r^ 


Bxr  ,  k  ^  2. 
e  (r  -1)  dr 


(3.11) 
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The  fundamental  solutions  w^{x)  and  w^{x)  are  given  hy  the  real  and 


imaginary  parts  of  Fix), 

Fix)  =  W^(x)  +  i  v^ix) 


(3.12) 


Integration  of  the  Integral  Solution 

We  shall  integrate  (3-ll)  to  a  linear  combination  of  f^ix),  f^ix), 
f^ix)  and  f^ix).  The  first  step  for  this  goal  is  to  change  the  range 
of  integration  in  (3.11)-  Introduce  a  complex  variable  z  =  3r,  where 
3  is  given  by  (SA).  Use  of  z  transforms  (3.11)  to 

l>  .  .  r—r~\  -  ^  / 

Fix)  = 


?  2  +y-Ai) 

6 


-1 


zx  ^-1,  U  - »  2  - 
e  $  (-3  -1)  dz 


(1^.1) 


The  range  of  integration  B  v  oo  g  in  (l+.i)  shall  be  changed  to  B  0 

and  0  "v  -  <=o.  Thus  (h.l)  becomes 
o 


F(x)  = 


— oo| 


l-lf 


k-2.2.,nr' 


-1 


2a;  .-lo-l/  2  , 

e  ^  B  iz  +1)  dz 

ih.2) 


In  the  above  equations,  quantities  inside  the  square  roots  are  chosen  to 
be  positive  in  order  to  insure  correct  forms  in  the  respective  ranges. 
Letting  z  =  Bi’  in  the  first  integral  and  z  =  — r  in  the  second  integral, 
ih.2)  transfonns  to  a  summation  of  normal  forms  of  integration, 

F(x)  =  ih^ix)  +  ih^ix)  -  «»exp(-vnrt/lj)  -  Bexp(y;Ti/l^)  (x) 


(U.3) 
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and 


00 


00  ^ _ 

ix)  =  +l) 


-xr  /  \  2  , 

e  (s’  +1)  dz* 


(i^.5) 


Expansion  of  exp($xz>)  transforms  (it.li)  to  power  series  of  a:, 
hy{x)  =  +3a;  h  ~  (ga:)^  +  ...  (i^.6) 


where 


h  (k) 

n  = 


f  (>•'  * 


n,^  hs  2 
r  (l-*r  )  or 


(h.7) 


Integration  of  (^+.7)  will  be  carried  out  later.  Integral  (H.5)  trans- 
fonns  to  power  series  of  x 

_  .  (1)  .  .  (1)_  .  „  (1)„  l+P  .  .  2  ^  n,  « 


+  X  +  . ..  (it. 8) 


f  \  (2)  ^  (2)  l-\i  ^  (2)  ^  (2)  2  ^ 

92^^^  =  9q  +  9^  ^  ^  +  92  ^  * 


{h.9) 


as  explained  in  the  next  section.  Thus  one  finds  F(a:)  in  the  following 
form. 


F(a:)  =  5  +  +  B  x  +  +  B  x^  +  ... 

o  u  -  1  y  2 


where 


o  \  o  o  / 


(yiTti  yiTT^  \ 

~  h  (1)  ^  k  (2) 

e  ®  ' 


-a(, 


g  +  e  g 

ynri  piri 

i+  _(i)  ,  ^  . 


+  e  / 


=  i( 


8  ( 


yTO  \xv^ 

~  ^  (i)  .  ^ 

e  g'  +  e  g 


Mil  \ 

^  4^ 


7 

b'*^>  =  -  6e  '■ 

y  »y 


(k.lO) 

(^4.11) 

(4.12) 

(4.13) 

(4.14) 


In  this  calculation  we  tentatively  assume  that  0  <  a  <  1.  Series  are  ar* 
ranged  in  the  ascending  order  on  this  assumption.  The  formulas  for  the 
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values  of  a  outside  the  range  0  <  a  <  1  will  be  derived  from  the  formulas 
in  this  range.  Note  that  the  entries  in  (i+.lO),  except  are  the  firs 

terms  of  where  m  =  0,  1,  2,  3.  The  first-order  term,  is  not 

contained  in  any  of  it  is  proved  later  that  =  0.  The  entries 

in  (h.io)  are  sufficient  to  express  as  a  linear  combinations  of 

f  (x)  . 

ha.  Formulas  of 
- - 

We  shall  give  the  integral  forms  of  by  successively  developing 

(^•5)  into  series  of  x.  Integration  of  these  fonnulas  will.be  carried 
out  in  the  next  section. 

Letting  x  =  0  in  (^.5)>  one  finds  : 


4., s  2  .4 


r.%1)"  (pVl) 


(4a. 1) 


To  find  ,  the  formula 


5'o(^)  -  g 


-1)  (r%l)  ^  dr 


shall  be  transformed  by  introducing  5  =  rx  to 


00 


(C^+Zs^+xS^  (e“^-l)  (5^+a;^)  ^  d? 


( 4a .  2 ) 


Letting  x  =  0  inside  the  integral,  one  finds 


(e  ^-1)  d? 


(4a. 3) 


(2)  '  1 

To  find  ,  multiply  x  ^  on  (iia.3)  and  subtract  it  from  {ka,2). 


Thus  one  finds 


00 


\{M~  ^  J  5^-4  (.-^-1)  dC 


Letting  5  =  rx,  this  transforms  to 


-/♦i 


-rx 

(2»)r  ^ ^  dr 
\  / 


(Ua.U) 


where 


(^^(r)  =  (r^+/?+i)^  (Al)  ^  -  2^ 


{^a.5) 


Because  of  the  inequality  1  ^  (l  -:  e  ^)/u  ^  1  u/2  the  integrand  of  {ka.,k) 

i 

is  uniformly  hounded-  One  can,  therefore,  let  a:  0  inside  the  integral. 


Thus 


(2).  . 


(j)l(r)  r  dr 


{i+a.6) 


To  find  9^2  ’  multiply  a:  on  (i+a.6)  and  subtract  it  from  (UaA).  Thus 

one  finds 

* 


-  f 


/  ^  2  -  1  +  ra: 

(r)  r  - 2"^ -  dr 

r  a; 


Because  of  the  inequality  0  ^  (l-u-e  ^)/u^  ^  ^  integrand  of  the 

last  integral  is  uniformly  hounded.  One  can,  therefore,  let  x  0  inside 
the  integral-  Thus 


=  I  j  ♦ifi 


r)  r  dr 


(lia.T) 
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To  find  ,  one  may  simply  differentiate  g^{x)  in  (J+.5)  with 
regard  to  x  and  let  a:  =  0  in  the  result.  Thus  one  finds 


oo 

1  =  -/  +  1 


)  ^  [r/(r^+l)  dn 


(lta.8) 


To  find  ,  use  (i*a.l)  and  (i*a.8)  to  derive  the  formula 


-  g 


(1)  Jl) 


S'!  ^ 


2.1k 


u. 

2  /  -r-x 


^  +  l)  (e  -  1  +  vx)  (r  +1)  or 


Letting  r  =  rx,  this  becomes 


J"  ^  U^+j^i^+x^)  ^  (e  ^-1+C 


(lta.9) 


Letting  x  =  0  inside  the  integral,  one  finds 


^ 


5"^  ^  (e  ^  -  1  +  g)  dC 


(ija.lO) 


?o  find  ,  -use  (iia.9)  and  (4a.l0)  to  derive  the  formula 

'i(»)  -  x"-- 


(C^+X**)  ^  (S^+A^+xS  ^  -2  ^  r-W-2 


(e~^-l+C)  dC 


Letting  5  “  this  transforms  to 


^^(r)  ^ *  dr 


(Ita.ll) 


where 
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HlCM 


(Al)  ^  (r^+/Al)  ^  -2  ^  p  ^  ^ 


Because  the  integrand  of  (Ua.ll)  is  uniformly  hoiinded,  one  can  let  m  ->  0 


inside  the  integral.  Thus  one  finds 


If 


r  dr 


{lta.l2) 


Uh.  Evaluation  of 

The  independent  variable  n  introduced  by 


2  ^  /  TT-,  -1/2 

.  r  + /  r  +  1  =  n 

is  useful  for  the  following  integrations.  This  transforms  to 
=  (1  -  Ti)/(2n^'^^) 

Jl  +  ?  =  (1  +  n)/(2n^^^) 


(J+b.l) 


r  dr  =  -[(1  +  dn 

Substituting  these  in  (^a.l),  one  finds 

b{\  . 

Eq.  (4a. 8)  is  similarly  integrated  to 

Jl)  -  .-1 


(4b. 2) 


(4b. 3) 


Use  of  q  transforms  (4a. 6)  to 

i2)  -  1  r  (,  _  „ 


■ 


1  -  (1  +  n)(i  -  n) 


By  letting  1  +  n  =  2  -  (l-n),  the  last  integral  is  divided  in  two  por- 

(2) 

tions;  thus  g^  becomes 
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=  -  i  r  -  1  7 

4  *^1  2  ^2 


where 


•^1 ' 


/  [i  -  -  n)^]  ^ 


dn 


and  the  remaining  integral  simply  integrates  to 
^2  =  -  B{u/2,  1  -  u/h) 

_  E 

After  partial  integration  of  n  1+  ,  integrates  to 

•^2  “  2B(u/2,  1  -  p/H) 

Thus  one  finds 
JB)  _  -1 

^1  ^  {k\>.k) 

Combining  (itb.3)  and  (ijb.it)  the  result  may  be  shown  with  a  single  formula, 

Jk)  _  T  -1 

^  (itb.5) 

After  partial  integration  of  (lta.3)  integrates;  after  two  times 

of  partial  integrations  of  ^  (Ita.lO)  integrates;  the  two  results 

are  shown  here  with  a  single  formula: 


=  +  2*^^^ 


.  =  +  2  [p(l+y)]“^  r(l+p) 


(itb.6) 


We  express  in  (Ua.T)  and  in  (i»a.l2)  with  a  single  formula. 


ik) 


CO 

^  2/ 


(r^+l)  ^  {r^  +  ^**+1)  -2 


,+p/2  ^-2+p 


2 

r  dr 


Use  of  p  changes  this  to 

i 

•  1 


„(k)  _  1 

^2  “7^ 

8v^ 


L 


1  -  (1  +  n)(l  -  Ti) 


__  y 
-1+  2 


5  +  y 

n  ‘  (1  -  dn 
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Letting  l  +  ii='2-(l-n)>  this  can  he  divided  in  tvo  integrals. 


-1  -1 

gW  =  (8/2) 


where  ^  p 

=  j  1  -  ^(1  -  r\)*  n 


1  +  li 

-  V  (1-n)^/^  dn 


^2  = 


and  the  remaining  integral  simply  integrates  to 

fl  +  v  3  -  A 

2~  ’  k  I 

After  the  partial  integration  of  n  ^  ^  integrates  to 

*1  '  j  ®  (2  •  ^) 

Thus  one  finds 

\b(|  ,  3-f^) 


1*C.  Integration  of  h 


(k) 


n 


Let 


r  =  c 


Then 


'^  =  (1  +  C^)/(2C) 


and 


Jl  -  =  i  (1  -  c)^/(2c) 


rdr  =  -  l)/(^C^)  d? 


fth.T) 


1) 
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Substituting  these  in  (h.f),  one  get 


_  2±1  ^  li 

e  ^  ^  (1  t  2 


(i*c.2) 


For  n  -  1,  this  integrates  to 

_  r  ,  •  ,  ^  f. 


+  1  -  exp(+lJ7Tt/i4  ) 


(^c . 3) 


To  integrate  (iic.2)  for  n=0  and  2,  it  is  noted  that 

n  n 

is  real.  To  show  this,  let  w  coe6  in  {li.Tj  to  set 


h'l)  t  i,(2) 

n  n 


n-1 

2 

(cos0)  cosliG  d0 


which  is  real.  Divide  the  contour  of  (^c.2)  in  two  parts. 


=  i2 
n 


2  Uk)  ^  Jk) 
\  n  '  n 


where 


—  iL 

+  2  2 


7  _  £  ILti 
•  +  2  ~  2 


Letting  ^  -  i/x^  the  latter  integrates  to 


=1  exp(-;;-^(+y+i-n  I  r  (  r 

Transforming  the  Gaiaa-functions ,  this  yields 
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itd.  Fundamental  solutions  for  0  <  a  <  1. 

Substituting  (i*b-2)  and  (lic.it)  into  (l+.ll),  one  gets 
B  =  (2/2^)-^  (Ud.l) 

O  4  4 

Substituting  (lib.  5)  and  (lie. 3)  into  (4.12),  one  gets 

5^  =  0  (lid.  2) 

Substituting  (lib. 7)  and  (lie. 5)  into  (4.13),  one  gets 

B^  =  t  v/27  {!-/)]  (4d.3) 

Substituting  (^b.6)  into  (U.lU),  one  gets 

Ji 

=  +  |y(l-y^)  2  r(2+y)  exp  (4d.4) 

Thus  one  finds 

Fix)  =  B^  f^ix)  +  B^^ix)  +  (^d-5) 

When  0  <  a  <  1,  functions  f  ix)  (m  =  0,  1,  2,  3)  are  real.  Therefore, 
fundamental  solutions  and  ^2^^^  found  by  deconpesing  the  coef¬ 

ficients  into  real  and  imaginary  parts. 


Thus 


^i(^)  =  fAx)  +  fix)  +.  f.  fix) 


(  4d .  6 ) 


^  +  Q,  foM  + 


I^(x)  +  +  q^  /^U) 


vh  ere 


Pq  =  (2/^)  ■ 


4  ^  '  4  ' 


(i+d.T) 


(iid.8) 


(itd.9) 


pi -I  +  2  /“\ 

+  pfl-p  )  2  ,  r(2+ii)  cos^^ 

4 


(4d,10) 


a,  =  +  111 i”  1  _,,^'il  o'^  2 


yll-y  ;  2  r(2+y) 


5-  Fundamental  Solutions  for  a  =  1 


When  a  -  1,  (l.y)  reduces  to 
d  w  .  2  d  w 


(3?y) 


(ijd.ll) 


Kevel  (1961)  gave  the  Fuchsian  type  solutions  of  this  equation  vith  the 


rotations  5 


nev^(5:) 


(-1)"  '•<p 

^^'^(2n).'  r(n+^ 


riev^(a.’)  =  ^  ^ 


V  (-I)"" 

^  ''(„<|)  tn+S) 


ix)  = 


Z 


0  l4-"'‘(2n+l).’  r(n^-f) 


r(f)  ]2 


and 


integral  (3.1).  The  transformed  differential  equation  is 

(1  +  II  +  =  0 

and  one  finds 

vU)  =  (1+^^)”^^^ 

Therefore  the  complex  solution  w{x)  for  a  =  1  is  found  in  the  integral 


form 


The  contour  L  is  the  one  shown  in  Figiire  1.  To  find  the  fundemeni.al 
solutions  in  the  form  of  the  linear  combination  of  the  nev  functions, 

J.  Dieudonne  (1958),  as  explained  in  Hevel  (1968),  expanded  (5-1)  into 
power  series  in  the  neighborhood  of  m  =  0,  and  determined  the  first  few 
coefficients.  The  fundamental  solutions  thus  found  are  denoted  here  by 


N  ^  N 
and  Wg 

w^(a;)  = 


(14/^)  ~  ^  (•^)  nev^ix)  -  ti(2»^) 


neV^ix) 


( 2\^) 


3 


nev^ix) 


and 

2  2 

w^(x)  =  (8A)-^  ^  (^)  nev^ix)  -  )~^  ^  (f)  nev^{x)  +  nel^{x)  - 

_  (1  _  Y  +  log  /2)  nev^ix) 
where  Y  is  Enler’s  constant  0-5772156. 
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\le  shall  show  in  the  following  that  and  (^d.6)  and 

(^d.7),  respecitvely,  gives 

lim  -u?  (a:)  =  w^(x) 

a^l  J-  2 

lim  w  (.r)  =  wh-)  -  /2  w/(a;) 

a-^1  2 

To  show  this,  note  that 

Pq  f  (^)  =  (2/^)  ^  ^  (h  nev  (x) 

a-^1  “  °  4  o'  ■' 

2 

lini  f^(x)  =  (2/27)-^  ^  (|)  nev^{x) 


G^l 


neVj^ix) 


We  shall  prove,  therefore,  that 


(h  4^^^  P2  4^^^)  -  >^  (1  -  Y  +  logv^  +  J)  7tev^{x)  +  /2nel  (x) 


li-  4(x)  .  f^U)) 


(5,  2) 


(1  -  Y  +  logv^  -  4  ?iev^(x)  -  /2  nel^ix) 


(5.3) 


The  left-hand  side  of  (5.2)  hecomi 


(Px  4'->  -  p.  4(-))  -  Z  ^  Xi™  i  ( 

v,-n  4  n!  U->0  ^  \ 


li™  -  -  0  in)  +  C  in] 

h->o  ^  \  / 


where 


Cj.(n}  =  2  r(2+y) 


r(|(2+-^})  r(^(3+p)j  r(|(5;;;p)) 
r^n+|(2+p))  r(n4(3+p))  r(nj^(5+u;) 


cosl^dl 
X  4 


Taking  the  limit,  one  finds  that 


lim  -f-C^{n)  +  C^{n)  j  = 

y->o  V 

-  /2  L(f)  (f)  I 

I  n  n\ 


n 


1  -  Y  -  log^  +  ^  + 

/2 


(!4P+1  *  2p  kp 


P=1 


n 


where  we  have  introduced  the  convention  that  the  s'oinination  ^  di-cippec^rs 

p-1 

when  n  =  0.  This  equation  proves  (5*2). 


The  left-hand  side  of  (5-3)  becomes 

Ln+1 


I  hn  , 
4  nl 


lim  —  Ls^in)  + 


lim  f^{x)  +  ^2  =  X] 

'  n=0 

where  5^  (n)  can  be  given  from  (n)  by  replacing  cos  ((3+y)TT/!4) 
with  sin  ((3+y)TT/l).  Taking  the  limit,  one  finds  that 

'  V 


lim 

y*^o 


/2  n!  (|)  (f) 

n  Yl 


-1 


n 


1  -  Y  -  log^  ■"  t 
V2 


n 


X](^P+l  2p  ■*■  itp-l) 


p=l 


where,  by  convention,  the  summation  ^  disappears  when  n  =  0.  This 

p=l 

proves  (5.3)- 

6.  Fundamental  Solutions  for  a,  -  0- 
When  <3=0,  (l.5)  reduces  to 


^2  1  ^  2 

d  _  ^  _1  ^ 

,2  X  d; 

dx 


W  +  W 


as  may  be  derived  by  putting  =  0  and  -  0  in  (1.2).  This  equation 


can  be  decomposed  in  two  equations, 
2 


.dr 


—  +  —  4—  -  i)  U-,  =  0 

2  X  ox  /  I 


(6.1) 


3ul 


and 


X  dx 


+  ^  =  0 


(6.2) 


The  solutions  of  the  tvo  equations  satisfyir.F  the 


ying  the  boundary  condition 


(1-6) 


ao  .r  =  <»  are 


'l  -  +  i  keix 


(6.3) 


Wp  -  kers;  -  i  kei; 


giving  the  fundamental  solutions  ke 
w  =  A  kerx  +  B  keix 


rm  and  kerm.  Thus 


(S.h) 


(6.5) 


We  Shell  prove  that  in  (i,d.6)  aod  t,  (x)  in  (ki.l)  satisfy 


lim  u  (m)  -  /2  kerx 

a-^o 


(6.6) 


lim  w  (x)  =  /2  keix 

a->o 


(6.7) 


First  v;e  note  that 

litn  f  (x)  =  lim  f  (x) 


ILm  f  ix)  =  lim  f  (x)  =  h 


Letting 


A  =  1  _ 


Ve  transform  (hd.S)  to 


1  .2 


1  .  Att 


1  %(l+y)  ^  r(2-u)*  ^  sir 


A  i,  ••'2 


-I-  i  \  —  r  /  2~A  \  T  /  \ 

A  )  ,r  ^  ("IT^/o  + 


Letting  A-^o, 


^  ‘•'o  (l-A)(2-A)  ^  r(3-A)  cos  ^  •/. 
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o  3 

—  beia:  +  /2(log2-a)  bera;  +  /2  gx  ^ 


lim  w.  ^ 

X^o  ^  2^2 

Substituting  from  (2.U)  and  (2.6),  one  finds  that 


lim  -  rr^)  =  -  logo:  berx 


X->o 


3X  3X 


E 

n=l 


(-1)  ^ 


E 


'=1 


which  proves  (6.6). 

We  transform  (^d.T)  "to 


w. 


^  2  ^  r(2+vi)*  f  sin^  •/,  + 


y(i+p) 


4 


X 

r/2+Xx  r,ii-x 


1^ 

2 


v/^(  2-X) 


(^)  -  ii_xU2-T)  2  ^  rd,).)  cosij  -4 


Letting  X->o, 

1  im  herx  -  v^(l‘-Y+log>'^)  heio:  + 

X^  2  2^2 

Substituting  from  (2.5)  and  (2.6),  one  finds  that 


,Vi  »4, 

r-  -  ti' 

2/2  X^o 


lim  ( 
Xt^o 


*4  !4 

3X  "  3X 


•)  =  Ulogx  beix  -  1+  ^ 


(-1) 


n  l+n+2 


2n+l 


2n+2 


^(2n+l):)^  ^ 


n=l  '  V  /  P=2 

which  proves  (6.7)- 

6a.  Eigenvalues  for  a  =  0. 

When  a  =  0>  no  horizontal  pressure  works  on  the  plate,  and  huckling 
should  not  take  place  under  any  boundary  conditions.  V.'e  shall  prove  celow 
that  this  is  true  under  the  boundary  conditions  (1.7),  (1.8),  and 
(1.9). 

The  following  formulas  are  needed  for  the  proof.  Substituting  eiuher 


(6.3)  into  (6.1)  or  (6.U)  into  (6.2),  one  finds  the  relations. 


and 


0 


( 6a .  1 ) 


ker"a;  +  a:  ^  ker'a:  +  keix  = 

kei"x  +  X  ^  kei'x  -  kerx  =0  (6a, 2) 

We  shall  prove  that  no  positive  number  x^  can  satisfy  the  clamped- 
edge  condition  (I.7).  The  determinant  of  (I.7)  is  given  by 

n  _  kerx  ker'x 
1 

keij:  kei’x 

when  (6.5)  is  used.  Differentiating  one  finds  the  differential 
equation 

1  --1  2  P 

^  +  X  =  ker  X  +  kei  x 

Solving  this  equation  under  the  boundary  condition  that  =  0  at 
^  one  finds 

00 

^1  "  ~i  i  ?(ker^£  +  ker^C)  dC 

X 

which  is  negative  for  any  positive  x,  proving  our  contention. 

We  shall  prove  that  no  positive  number  x^  can  satisfy  the  simple- 
edge  condition  (I.8).  The  determinant  of  (I.8)  transforms  to 

^  _  kerx  (l-u)x  ^  ker'x  +  keix 

2 

keix  (l-v)x"^  kei'x  -  kerx 
This  equation  transforms  to 

00 

^2  ~  “x“  f  C(ker^C  +  kei^?)  dC  -  (ker^x  +  kei^x), 

X 

which  IS  negative  for  any  positive  x,  proving  our  contention. 

ke  shall  prove  that  no  positive  number  x^  can  satisfy  the  free-edge 
condition  (1.9)-  The  determinant  of  (I.9)  transforms  to 
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D. 


ker’x  +  keix  -  kei’x 


(l-v)x  ^  kei*x  “  kerx  ker’x 


This  equation  transforiris  to 


D, 


=  2^  (ker’^ic  +  kei'^x)  x  f  kei^C)  d? 


X 


vhich  is  positive  for  any  positive  x,  proving  our  contention. 

7.  Fundamental  Solutions  for  a  >  1. 

For  a  >  1,  P  defined  in  (2.1)  must  "be  replaced  vith  \i  =  ik,  vhere 

(7.1) 


K  =  va- 

To  compute  r(x  +  iy)  ■>  'wg  use  the  formulas 


T{x+iy)/T{x) 


n 

n=0 


2  2 
1  +  y  /(x+n) 


and 


Arg  r(x+7y)  =  y^{x)  +  ^  L/(x+n)  -  tan  hy/(a;+n)]| 

n=0  ^ 


(7.2) 


(7.3) 


[Handbook  (ref.  5),  p.  256].  These  formulas  can  be  proved  by  use  of 
Euler's  formula  for  the  Gamma  function  (ref.  (9)»  p.  237). 

Using  these  formulas,  coefficients  of  F(x)  in  (iid.5)  become 

-1 


B  =  ^(271)’  2  T^ih  n  [1  +  K^ihp+l)'^] 

O  p-Q 


(7.1+) 


4(2tt)  ^  r^(^)  n  [1  +  <'^(l+p+3) 

p=0 


-1 


B. 


a 


B 


=  7(aO"^  ^  ejcp(+ 


(7.5) 


(7.6) 


where  R  and  0  are  defined  by 
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r(2+iK)  =  R  exp(?t0) 

They  are  given  by 

oo 

n 


R  =  (n  +  2)  [{n+2)‘^  +  k^] 

n=0 


1 

2-,  2 


and 

OO 

0  =  k(1  -  y)  + 


n=0 


[K/(n+2)]  -  tan  ^  [K/(n+2)] 


Functions  /^(x)  and  are  real.  To  decompose  the  complex 

function  into  the  real  and  Imaginary  parts,  the  denominator  of 

(2.3), 

( i+n+l+y )  ( hn-l+y )  ( Itn )  ( l*n+2p ) 
is  transformed  to 

=  8n[2n(i6n^  +  1*  _  5a)  +  ^<(32^^  _  a)] 

Thus  one  finds 


=  E  (-1)V°>  e*p(+ip  ) 

n=0 


where 


=  1 

o 


=  0 


(o) 


n 


er^{ni)~^ 


r  n 

n| 

Lp=i 

p  P 

p  (hp^  +  1 

-  ■^)^  +  (a-  l)(itp^  -  1 

n 


^  tan“^  !c(i4p^  “  f 
p=l  I  ‘ 


pihp^  +  1  -  |a) 


for  n  ^  1, 

Fundamental  solutions  w^ix) ,  and  their  derivatives  are 

found  by  decomposing  Fix)  and  its  derivatives  into  real  and  imaginary 
parts.  We  formulated  them  (up  to  the  third  derivatives)  as  follows: 
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PO|H 


00 


r^i  r  1-1 

i-A-  n 

2*^  p=0  L  i 


H  exp(KTr/tt)  (-if  P^'"^  c°s(e 


P 


j?  expC-Kir/H)  (-1)”  P^  X 

a<  JLmJ  ^ 


cos(62^P„-C) 


(T.T) 


=  i-^  n  1*-^  -s 


(kp+3) 


R  ex'p(  kt/U) 


V  (-1)”  p*"'  « 


1”' »’•"*"-”  =in(e,  -P„*C’) 


{ktt/^) 


£ 


2  "n  n 


(7.8) 


where 


^  +  iclog^  +  (1-y)<  +  (p+2 

p=0 


-  tan 


pfe) 


p(°)  [(Un.l)^.K^l 


\(Unf^  <2 1  2 

P^^^  [(Un-l)^  +  K^l  ^ 


=  tan  |K/(iin+l) 
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r 


J2) 

II 

-0- 

o 

H 

'  (3) 
^o 

^(3) 

+  — 

2 

+  tan~^[</(ljn)] 

+  If  -  tan~^K 

+  tan~^[ic/(itn-l)] 


for  n  >  1 


for  n  >  1 
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PART  II.  ASYMPTOTIC  EXPAHSIOIS 


Values  of  a  series  solution  developed  in  PART  I  must  overlap  on 
a  certain  range  of  x  with"the  values  of.  an  asymptotic  expansion  determined 
corresponding  to  the  respective  series  solution.  The  series  may  he  used 
for  any  x  less  than  the  overlapping  range,  and  the  asymptotic  expansion 
may  he  used  for  any  x  larger  than  the  overlapping  range. 


8.  Asymptotic  Expansion  for  0  <  a  ^  1. 

Using  analytical  continuation  of  the  hypergeometric  function  in 
(3.8)  from  the  range  i  <  r  <  «>  into  the  neighborhood  of  r  =  1  (more 
exactly  in  the  range  ll-r^j  <  l),  one  finds  that  in  the  contour 

integral  solution  (3.11 )  defined  in  the  range  1  <  r  <  “  is  analytically 

continued  to 


,(^)  p/i(2+y),  ^(2-y);  |  ;  1-r^  + 

'  1  / 


+  V 


ik)  2rh 


r>-(r^-l)  ^  F(^(2+y),  ^(2-y);  |  ;  l-r 


(8.1) 


-1 


defined  in  the  neighborhood  of  r  -  1,  where 

=  _  2/^  ^(|(2+y))  ^  j|(2+y)] 

Double  signs  may  not  appear  in  the  hypergeometric  functions  on  the  right- 
hand  side  of  (8.1)  because  of  their  sjonmetric  properties  with  regard  to 


(8.2) 


(8.3) 


the  first  and  second  parameters. 

Letting  r  =  1  +  t  and  developing  the  hypergeometric  functions  on 
the  right-hand  side  of  (8.1)  into  power  series,  one  can  integrate  (8.l) 
to  a  complex— form  asymptotic  expansion  for  0  =  ^  i 
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Fix)  - 


where 


=  -d  +  2a)/k 

^2  =  (9  +  20a  +  ka^)/g6 

=  -  (3  +  a)/6 


^2  ~  ■*■  +  a)/l20 

Asj«ptotl=  expansions  for  w^{x)  and  are  given  by  the  real  and 

imaginary  parts  of  (8.1*),  respactively. 

Asymptotic  Expansion  for  1  ±  a  <  2. 

A  form  of  asymptotic  expansion  for  a  >  1  is  found  by  letting 
«  -  ix  in  the  coefficients  of  e  (i  =  1,2).  i„ 

/omulas  (7.2)  and  (7.3)  need  to  be  modified  to  include  the  case 

^  ~  0.  The  modified  formulas  are: 

00 

\niy)\^  =  Z/-2  II 

n=l  ly.i; 

and 
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^  /  1  \ 

Arg  niy)  =  -  f  -  Vt  *  Z)  S)  > 

n=l  ' 


(9.2) 


where 


iign(y) 


=  -1 


for  V  >  o 


for  y  <  o 


The  result  of  the  transformation  hecomes  extremely  simple: 

+  y^^^  =  cos(iclog2) 

1  1 


o  O 


=  K  sin[tc{^  +  log*^)] 


(9-3) 


(9.M 


Substituting  these  into  (8.M  the  complex  form  asymptotic  expansion  for 
a  ^  1  is  found- 

Our  numerical  computation  shows  that  this  asymptotic  expansion 
is  effective  only  for  a  close  to  1.  We  used  this  formula  for 

2  1  a  ^  1. 

10.  Asymtotic  expansion  for  a  ^  2. 

Letting  y  =  iK,  the  integral  solution  (3-11)  transforms  to 

CO  ^3^ 

I  F{m)  =  /*  cos  [flog  +  >/^^l)  (Al)  ^  dm  (lO.l) 


Expanding  the  integrand  in  the  neighborhood  of  r  -  1  by  letting 
r  =  1  +  t,  and  using  the  approximations, 

log(r^  +  =  2/t  + 


=  2/t  +  0(t) 

one  finds  the  integral  asymptotic  solution. 
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FU) 


CO 

I 


COs(K/t)  ^ 

/i 


To  evaluate  this  integral,  define  the  function. 


o^U)  =  i 


/ 


/t 


Then  (10.2)  becomes 


F(a:) 


6m 


+  G^ix)^ 


2 

Letting  t  =  %  ,  (10.3)  transforms  to 

2 


GAx)  = 


f  exp  6x(y+  ||)^  + 


K _ 

i*6x 


dS 


(10.2) 


(10.3) 


(10.1;) 


Define  2  by 


3;^  (C?f)^  = 


The  root  a  of  this  equation  satisfying  the  condition  that  the  real  part 

of  a  must  approach  positive  infinity  as  C  ®  is 

1 


a  =  exp(-  ^)  5  +  exp(-^)  |Km  2 

Use  of  a  thus  defined  transforms  (IO.3)  to 

1  -  “exp(5'n"i/8) 


5Tri.  1-4 


G,(x)  =  X  2 


2 

/  K  , 

+  -8> 


/ 


exp (-2  )  d2 


where 


—  5  1  ^ 

a.  =  +exp(-^t:)  2 


'k  ■  2' 

Transforming  the  contour  of  integration  to  the  sm  of  tvo 

a^'^'O  and  o'v+co^  one  finds 

G^(x)  +  G^{x)  =  A/x  exp^ic^/(86x)  +  ni/d^ 

Thus  one  gets 


contours , 


(10.5) 
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(10.6) 


F(x)  —  /ir/x  exp(3a;  +  ic^/(8&c)  +  iri/S) 

VJe  use  this  equation  for  <2^2. 

11.  Fundamental  Solutions  for  Large  a  and  Small  x. 

Our  numerical  computation  shows  that  the  overlapping  range  of  the 
series  solution  and  the  asymptotic  expansion  moves  to  small  values  of 
X  as  the  values  of  a  increases.  When  a  =  2,  the  series  solution  and  the 
asymptotic  expansion  overlap  in  the  neighborhood  of  ic  —  6.  When  ci  -  6^ 
they  overlap  in  the  neighborhood  of  x  =  1,  showing  that  the  fundamental 
solutions  at  this  value  of  ci  is  ineffective-  For  larger  cz,  fundamental 
solutions  must  be  transformed  to  a  more  effective  form. 


Following  formulas  were  used  for  the  transformation.  For  large 
values  of  y 

r(x+iy)  ~  e 

and 


(11.1) 


r(xHy) 


r  {n+x-^y ) 


-1 


+  ^iri 

2  -n 

e  y 


(11.2) 


where  x  and  y  are  real.  These  fornulas  can  he  derived  hy  transforming 
the  asymptotic  expansions  of  the  Gamma-functions  hy  using  the  assumption 
that  y  is  large. 

When  X  is  small,  the  nvmiher  of  terms  needed  for  the  summation  of 
series  f  (x)  (m  =  0,  1,  2,  3)  in  (2.1;)  (2.6)  are  fairly  small. 

777 

Letting  k  be  large  under  this  condition,  formulas  (ll.l)  and  (11.2)  may 


he  applied  to  transform  series  f^(x).  Thus  one  finds 

f  (x)  cos [(2k)  ^x^] 
o 


/^(x)  ~  2k  sin[(2K)“^x^] 


.  (11.3) 

(11. 1<) 
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and 


exp[+i(4K^)“^a;^]  (11.5) 


Also  one  finds 

1 

1 

/i  \ 

/l  \ 

— 

pCTT 

r(j(i+t<)) 

rMi-iK)) 

2 

klT  K 

e 

4 

(11.6) 

\  / 

1 

1 

r(^(3-iK)) 

2 

—  7T  K  e 

(11.7) 

and 

1 

1 

3  . 

Q  O 

-  '^TT 

+iK 

e 

r(2+iK)  ~ 

(27IK^)^ 

e  ^  (K/e) 

(11.8) 

Thus  for  extremely  large  k  and  small  x,  one  finds  the  complex  expression. 


T{x) 

(11.9) 

where 

= 

1  1 
-  P  - 

(2Tr/K)  e 

(11.10) 

B  ^ 

1  •  13  1  . 

i2  2  (2^)2  ^  2  ^  it 

(11.11) 

C  = 

1,'  13  3. 

9  “  p  “  ■rnK-^K  . 

O  ^  f  OiT  \  ^  ^  ^  • 

-  ^  (27t;  K  e  K 

(11.12) 
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PAET  III  EIGENVALUES 


12.  Computation  of 

Our  numerical  computation  shovs  that  the  clamped—edge  condition 

(1.7)  and  the  simple-edge  condition  (I.8)  do  not  yield  any  positive 

numher  x  as  a  root  of  the  respective  determinant  equations.  The  free- 
o 

edge  condition  (I.9)  always  yields  roots  or  a  root.  We  shall  discuss 
helow  only  the  free-edge  condition. 

Define  operators 


L_  +  —  — 

2  X  dx 


(12,1) 


^  2 
M  ^  1  ^ 

7^  2 

dx  dx 


1-a  d 
'2  dx 


(12.2) 


Then  the  determinant  D  found  by  substituting  (l.lH)  into  (I.9)  is  given  by 


M(w^) 


1l(w„)  m(w_) 


(12.3) 


Root  X  thus  found  in  the  range  0  <  a  <  2  are  shown  in  Figure  2 
o  ~ 


and  3* 


To  discuss  the  neighborhood  of  x^  -  0  in  these  figures,  take  the 
first  term  of  the  series  fix)  (m  =  0,  1,  2,  3),  and  approximate  u^(x 


and  wAx)  in  (ltd. 6)  and  (4d.T)  with 

^  -1  .  _  T  , 


w^(x)  =  Pq  +  ^  +  0  (x  ) 


(12.4) 


wAx)  =  qx^  +  <?,x  +  q„x  ^  +  0(x  '^) 


(12.5) 


Because  M(x^— *^)  =  0,  M(w^)  is  negligible  against  M(w2)*  Therefore  the 
root  of  (12.3)  is  given  by  L(u^)  =  0,  which  yields 
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1  \  (v-v)  r(l+y)  cos[{3+y)Ti/^] 

^  1  ~  ■■■' 

/2  7  (v+m)  r(l-y)  cos[  (3-y 


(12.6) 


This  equation  shows  that  the  condition  ii_<v,  i.e.  ,  must  be  met. 

When  y  =  \;,  becomes  equal  to  zero.  Each  curve  in  Figure  2,  therefore, 

terminates  at  the  intersection  with  the  axis  of  abscissa,  whose  coordinate 
2 

is  (2  =  1  -  V  . 

For  small  p,  (12.6)  becomes 


-  Y 


1  1  2{ 

-  V  ^  3  M- 


^(3)  -  ^ 


(12.7) 


where  c{3)  is  Rieman’s  Zet a- function.  Our  numerical  computation  shows 
that  (12.T)  gives  close  approximation  over  the  entire  lengths  of  the 
curves  in  the  neighborhood  of  a  =  1-0  in  Figure  3. 

To  discuss  the  neighborhood  of  =  0  for  the  case  1  ^  <2  ^  2,  we 
used  the  complex  form  F(x)  in  (^d.5)  with  coefficients  given  by  (7-^)  ~ 
(T.6).  Taking  the  first  terms  of  /^(x) ,  one  finds  that 


M(F)  =  25^  (l+K^)x  ^ 

Because  is  a  pure  imaginary,  the  real  part  of  M(F),  i.e.  M(Zi?^),  is 
negligible  against  the  imaginary  part  of  M(f),  i.e.  ¥x{w^).  Therefore 
=  0  is  equivalent  to  L(a?^)  =  0.  Equating  the  real  part  of  L(F) 
equal  to  zero,  one  finds  that  x^  is  approximated  by  the  root  of 
tan(a+Klnx)  = 

[(v-K  )  exp(|j^7r)  -  k(1+v)  exp ( --jnc tt )  ]  •  [k(1+v)  expC^ir)  + 

where 

00 

a  =  f  -  Klog/2  -  (!-«)<  -  -  tan"^ 

n=0  '  ' 


(v-K^)  exp(-^iT)] 
(12.8) 

(12.9) 
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For  sjTiall  k  (12.8)  reduces  to 
vH(a;)  +  1  +  V  -  ^  = 

H(ic)]  (12.10) 

vhere 

H{x)  =  log[(/2)  a:j  -  1  +  y  (l2.1l) 

Our  numerical  computation  shows  that  this  equation  gives  close  approxi¬ 
mation  over  the  entire  lengths  of  the  curves  in  the  neighborhood  of 
<2  -  1  +  0  in  Figure  3. 

Equation  (12.8)  shows  that,  if  is  a  root,  then  given  by 

^n+1  "  (12.12) 

is  also  a  root.  Therefore  infinitely  many  roots  exist  in  the  neighbor¬ 
hood  of  ar  =  0.  Roots  x^,  x^  and  x^  are  shown  in  Figure  it  where 
^  used  as  the  ordinate.  The  solid  line  covers  the  values  we 

actually  computed.  They  may  be  extended  to  the  left  of  the  solid  lines 
by  means  of  (12.10)  and  (12.12). 

The  asymptotic  behavior  of  the  large  roots  can  be  found  by  using 
F(a;)  in  (l0.6)  to  compute  L(F)  and  M(F).  Assuming  that  5  defined  by 
C  =  <^/{hx^) 

is  of  the  ordinary  magnitude  for  large  x,  one  finds 
L(F)  _  F(x)  (e^  -  2C  + 

and 

MiF)  -  F(x)  rexp(ii)  -  c  exp(.  if)] 

Thus  one  discovers  that  there  are  two  asymptotic  roots. 


(12.13) 

(12.11+) 

(12.15) 


,  ,(l+v)Tr  vir 
^  8 


-  1  - 


(1+v)tt 


VTT 

32 
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range  0  ^  k  <  «>  using  v  (Poisson's  ratio)  as  parameter 


(12.16) 


X 


h 


where 


1 

2  K 


h 


2  +  /s 


(12.17) 


In  these  two  equations,  suffix  h  is  defined  by 

^  =  k  -  1,  (12.18) 

where  the  old  convention  for  suffix  k  is  still  observed.  The  two  lines 
in  Figure  h  expressing  the  two  equations  in  (I2,l6)  are  shown  by  the  broken 
lines. 


Asymptotic  roots  were  also  computed  retaining  all  the  terms  in 
L(F)  and  M(F)  that  were  found  by  letting  F{x)  be  (10.6).  Carrying  out 
the  computation  of  as  given  by  (12.3),  one  finds  that  the  equation 
-  0  reduces  to 

\  °  (12.19) 

n=0 

where 

=  (l+c)(l+c^)(l-l;e+C^)  (12.20) 

'  [(1-v)  +  Itc  -2C^  +  -  (3-v)c^] 

^2  =  -|+5(2-v)^  -  (6-v)^^  +  (l2-7v)c^ 

^  /2  [-(3+v)  +  2(9-litv)c  -  (15-I3v)c^] 


"  16  +3  (15-i6v)c] 

The  positive  roots  of  are  and  in  (l2.17).  The  solid  lines 
running  close  to  the  broken  lines  in  Figure  cover  the  values  of 
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a:  and  a:,  computed  for  v  =  0-5  by  use  of  (12.19).  The  values  of  m  and 
o  1 

in  the  range  a  ^  2  vere  computed  by  using  the  series  (7-7) 
and  (7.8). 

The  asymptotic  behavior  of  the  small  roots  can  be  found  by  using 
(11.9)  to  compute  L(F)  and  M(F).  One  finds  that 

M{F)  =  i  A  {kx)  ^  (l+K^)  (12 .21) 

Because  A  is  real,  M(w^)  is  negligible  against  14(^2)  ^  =  0  is 

equivalent  to  “  0.  For  large  k,  this  yields 

X  =  /2  e 

which,  however,  is  not  small.  Therefore  small  roots  do  not  accumulate  at 
point  5:  =  0,  when  k  is  large. 

This  conclusion  does  not  yet  exclude  the  possible  existence  of 
roots  that  are  too  small  to  be  found  with  the  asymptotic  expansion  (10. 6) 
but  too  large  to  be' found  with  the  approximation  (ll.9).  It  is  probably 
true,  however,  that  roots  x^  (n  2)  become  equal  to  zero  at  certain 
values  of  k  and  do  not  extend  indefinitely  to  large  values  of  the 
ordinate. 

Extension  of  the  curves  expressing  x^  (n  ^  2)  beyond  the  ordinate 
K  >1  /S  was  not  attempted.  Our  interest  was  originally  in  small  a, 
and  moreover  we  did  not  have  enough  time  to  have  series  improve 
for  the  case  u  >  1.  However,  we  believe  that  small  roots  are  not 
important  for  engineering  purposes  and  need  not  be  known  in  detail. 

13-  Deformation 

Forms  of  deformation  corresponding  to  the  roots  were  calculated  in 

p  ,  * 

the  range  1-v  <_  <2  2  by  assuming  the  normalization. 
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iu(x  )  =  1  (13.1) 

n 

Tv70  cases,  (a  =  1,  v  =  0.3)  and  (a  =  ^4 ,  v  =  0. 3) ,  are  shovn  in  Figure 
5  and  6,  respectively. 

These  forms  of  deformation  have  often  been  observed  in  laboratories 

and  fields  when  floating  ice  plates  are  compressed.  V/e  are  now  convinced 

that  buckling  is  frequently  taking  place. 

Forms  of  deformation  other  than  sho\m  in  Figure  5  and  6  can  be  guessed 

by  use  of  Figure  7  and  8,  where  the  values  of  w  .  (minimum  depression) 

TTi'in 

and  X  .  (defined  by  w  .  =  w{x  .  ))  determined  for  x  are  shown.  (See 

rmn  rmn  rmn  o 

Figure  5  for  the  definition  of  a:  .  on  a  curve  of  deformation).  Values 

rmn 

2 

CL  in  these  figures  are  restricted  to  1-v  ^  a  ^  2.  We  did  not  compute 
them  for  the  case  a  >  2,  nor  for  x^  (n  ^  1)  except  for  the  cases  shown 
in  Figure  6.  The  broken  lines  in  these  figures  are  determined  by  the 
terminal  condition  v  =  y. 

The  deformation  at  fracture  shall  be  determined  by  assuming  that 

the  stress  at  x  .  reaches  the  fracture  stress  On  .  In  the  general 
rmn  f 

polar  coordinates,  stress  components  ^60’  (see 

Appendix  C)  by 


rr 


08 


2(l-v) 


Eh 


2(l-v 


2  2 
^  W  ^  V  ^  V  3  u? 

.2  r  dr  2._2 

dr  r  96 


1  M  +  iJ£ 

r  Sr  2  „.2  „  2 

r  30  3r 


(13.2) 


and 


r6 


Eh 


2(l+v) 


3_ 

dr 


(1  3z£\ 

[r  30/ 
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vhere  h  is  the  thickness  of  the  plate.  In  our  case  of  axisymmetry, 
introducing  the  nondimensional  length  x  defined  by  (l.^)»  the  above 
formulas  become 


Eh 


TV 


2(l-v)^^  dx 


_  d  U  V  ^ 

2  .  2  X  dx 


(13.3) 


-Eh 


ee 


2(l-v)2. 


,2 

diJ  a  It; 

OX 


and 


a  ^  =  0 

r0 


At  point  X  .  ,  vhere  dw/dx  =  0,  therefore, 
^  mvn 


^vt\  >  1^66 


Let  W^{x)  be  the  normalization  of  w{x)  at  x  -  x^.  Then  the  de 


pression  is  given  by 
ii;(x)  =  K 

where  K  shall  be  determined  by  applying  the  condition  that 


(13. H) 


IVI  "  °f  ^  =  ^min 

where  is  the  fracture  strength.  Summing  up  the  above  results, 
K  Is  found: 

K  =  21^  o„  (Eh)"^  H(v,a) 

o  f 


(13.5) 


vhere 


H(v,a)  =  (l-v2)/(^ 

Idx 


(13.6) 


(13. T) 


X 


rmn 


Values  of  H{v,a)  are  shown  in  Figure  9  for  the  case  1-v  ^2 
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APPEI'^DICES 


In  the  following  Appendices  A,  B,  and  C,  transformation  of  tensor 
components  utilized  in  this  paper  are  derived  hy  use  of  the  tensor  nota¬ 
tion  where  tensors  are  expressed  in  combinations  of  components  and  base 
vectors.  This  tensor  expression  yields  simpler  and  more  enjoyable 
analysis  of  component  transformations  in  Euclidean  space  than  the  con¬ 
ventional  tensorial  expressions  where  base  vectors  are  omitted,  because 
geometric  and  mechanical  quantities  are  explicitly  shown  in  the  former 
and  therefore  the  meaning  of  the  step  by  step  computation  is  clear. 

In  the  Appendix  D.,  the  deformation  for  the  case  a  =  «>  is  derived. 

In  the  Appendix  E,  the  buckling  of  the  semi-infinite  plate  is  discussed. 
It  is  interesting  to  note  that  both  cases  pertains  to  the  case  of 

X  -  but  they  are  substantially  different, 
o 

A.  Transformation  of  (l.lO)  to  (l.ll). 

Shears  Q  and  Q  in  rectangular  coordinates  are  the  magnitude  per 
X  y 

unit  length  of  the  shears  acting  on  a  side  normal  to  the  m-axis  and  y- 

axis,  respectively,  (see  Figure  10).  Ve  shall  begin  with  expressing 

Q  and  Q  as  components  of  a  vector.  Let  c  and  c  be  unit  vectors  in 
^x  y  X  y 

the  x-  and  y-directions.  In  Figure  11,  let  be  a  unit  vector  normal 

to  the  hypotenuse  AB  (Fig.  ll).  Vector  is  given  by 

cds=cdy  +  cda:  (A.l) 

n  X  ^  y 

because  c  thus  defined  satisfies  the  condition 
n 

c  •  c  =  dy/ds 

n  X 

and 

c  •  c  =  dx/ds 

n  y 


409 


where  the  dot  (•)  hetween  two  vectors  means  the  scalar  product  of  the 
two  vectors.  Let  he  the  shear  per  unit  length  of  the  hypotenuse 
It  is  given  by 


(A. 2) 


(A. 3) 


ds  -  dy  +  Q  dx 

We  can  nov  prove  that  the  equation 

Q  “  $  c  +  §  c 

^  X  y  y 

is  the  desired  vector  ccmhination  of  0  and  Q  because  the  relation 

Q  •  c  =  $ 
n 

is  satisfied. 

Substitute  (1,10)  into  (A, 3)  and  transform  the  result  to  a  tensor- 
invariant  form: 


Q  ~  \7 .  ^ 


N 


where 


V  =  c  |-  +  c  — 

xdx  y  ’^y 


(A.k) 


(A.5) 


(A. 6) 


In  (A.l*),  a  convention  is  made  that  a. be  means  (a.b)c. 

Let  and  be  the  unit  vectors  in  the  r-  and  e-directions.  They 
are  given  by 


and 


^  o  COS0  +  C  sine 

^  y 


^e  ~c  sine  +  c  cose 


(A.8) 


These  equations  yield 
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0 


=  -  u 


(A. 9) 


In  the  polar  coordinates,  (A. 5)  hecomes 

^  ~  “r  rae 


(A. 10) 


Q  = 


L  3  a 

U  "T —  +  U  ■■— '  — 

I  r  ar  e  rae 


-j  .M  + 


U  +  u  — 

p  ar  e  ae 


)  .N, 


where 


(a. 11) 


M  u  u  +  Af  „(u  u  +  u  u  )  +  W  n  11 
rr  r  V  re'-  r  6  6  r^  66  ^6^6 


(A. 12) 


"rrVr  “Ve  <“r“e  +  “e“r>  *  "ee  “e“( 


(A. 13) 


In  the  polar  coordinates,  (A3)  hecmei 


Q  =  $  u  +  Qu 
y  r  6  6 


(A. lit) 


Carry  out  the  differentiation  in  (A.ll)  by  use  of  (A. 9)  and  the  scalar 
products  indicated  by  dot  (•)  and  identify  the  components  with  those 
of  (A.lii),  then  one  finds  (l.ll). 

B,  Transformation  cf  (l.l)  to  (1.2) 

We  shall  prove  the  fomula  in  the  general  polar  coordinates, 

2  2  P 

/i7  ^  ^  .  qA?  ,§ _ a.  77  9 


N  +  211  —  {—  — )+  N  (—  I  1  3zJ 

ar^  ^  (p2  3^2  r  ap 


(B.l) 
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The  left-hand  side  of  (B.l)  is  a  tensor-invarient  form  N  • •  77  W 
where  N  is  given  hy  (A.T),  and  77  W  is  a  dyadic. 


77W 


(S;  8a;  *  By) 


dW  ■ 

+  C 

9a;  y 


The  double  dot  ( • • )  of  ab- -cd  means  (b.c) (a.d) .  In  polar  coordinates. 


77W 


/  3 

9  \ 

1  3 

3w  \ 

rdej 

K  3r 

*  rde) 

Carrying  out  the  differentiation  given  hy  (A. 9) 5  one  finds 

ar'r  30' 


77tJ  =  ^ 

dr 


,1  j.  1  Sivii  11 
-2  ?  3r>  “e“e 

r  30 


(B.2) 


Carrying  out  the  double  dot  products  by  use  of  N  in  (A7)  and  77  «  m 
(B.2),  one  finds  that  N**77w  becomes  the  right-hand  side  of  (B.l). 

C .  Proof  of  (13 *2) 

Substituting  Equation  (l.U)  of  Mansfield  (l),  one  can  transform 
the  tensor  equation 


<y  =  OCC  +  acc  + 

X  a;  X  y  y  y  xy  x  y  y  x 


to  an  tensor- invariant  form 

O  =  -  [£;z/(i-v^)]  {yv-j  +4/iwj 

The  tensor-invariant  operator 


(C.l) 


■becomes 


„  9  1- 

^  rae 


{C.2) 


in  the  polar  coordinates. 

Substituting  (A.IO)  and  (C.2),  and  carrying  out  the  differentia¬ 
tion  as  given  by  (A.9),  one  finds  that  (C.l)  becomes 
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'^re’  °eQ  3.re  given  by  (13.2). 

Deformation  for  a  =  oo. 

Let  u^(j:)  and  u^(x)  be  defined  with  the  real  and  imaginary  parts  of 
the  right-hand  side  of  (10.6): 


(x)  =  R  cosJ 


and 


W^{x)  =  B  sini 


Where 


R{x)  =  {ii/xf  exp 


and 


I(x)  =  ir/8  +  x//2  -  K^/(h/^) 
The  depression  is  given  by 

w(a^)  =  A  w^(x)  +  B  V  (x) 


When  Kc  -  00,  there  are  two  positive  roots  given  by  (l2.l6).  The 


ratio 


A 

L(w2) 

B 

■  L(w^)  ■  “ 

M(W^) 

can  be  c 

:omputed  by  using 

(12.14)  and  (12.15). 

A 

(l+c^)  cosJ^  + 

(l-C7z)  sinJ, 
h 

B 

(1+?;^)  sinJ^  - 

(1-C^)  cosJ^ 

where 

H 

II 

and  h  is 

defined  by  (l2.l8).  Normalizing  w{x) 

A  = 

(cosl^ 

1  (•^)  SinJ,) 

and 

ft  / 

B  = 

=  (slnJj 

-1  . 

+  COsJ^I 
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Thus  the  normalized  deformation  is  given  hy 


wJx)  =  B{x)/R.  ccsilix)  -  I 


)  +  (/3)  sin  (I(x)  -  I,  )  (D 


Letting 


I  <• 

and  assuming  ?  to  he  finite,  one  may  let  in  (D.l).  Thus  one  find: 


wJx)  =  exp(-5//2)  cobU/^)  +  {^)  sin(5/'^)] 

/v 

The  maximum  of  w^{x)  occurs  at 
tan(5/'^)  =  -  2  +  /3 

vhich  is  negative.  Therefore  w^{x)  is  always  decreasing  for  ?  >  0. 

The  deformation  at  a  =  therefore,  does  not  take  a  minimum,  as  those 

2 

(shown  in  Figure  5  and  6)  of  case  1-v  <  2  do. 

E.  Rngkl ing  of  semi-infinite  plate 

We  shall  show  that  the  deformation  discovered  in  the  preceding 
section  is  different  from  the  buckling  deformation  of  a  rectangular 
Semi- infinite  floating  plate* 

We  assume  that  uniform  pressure  N  is  applied  on  the  axis  y,  the 


axis  X  extending  from  x  -  0  to  m  Then  from  (l.l)  one  gets 


h  ^2 

d  w  ,,  <i  ^ 

-T  ^  ^  ^  ^xxT2 
dm  ar 


(E.l) 


where  we  have  put  q  =  0.  Defining  new  x  by  the  quotient  of  old  x 
divided  by  the  characteristic  length  ,  (E.l)  becomes 

^  +  2a  ^  +  w  =  0 

dm^  dx^ 


(E.2) 


where  we  have  pu' 
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When  a  >.  1,  letting 
a  =  cos?i  2t\ 

one  finds  four  fundamental  solutions 

cosvar,  sinvx,  cos(x/v),  and  sin(x/v) 


vhere 


V  =  exp(n) 

We  shall  discuss  helov  only  the  case  0  <  1,  "because  the  boundary 

condition  at  x  =  ~  cannot  he  satisfied  in  the  other  cases. 

The  fi’ee— edge  condiliion  for  ‘this  case  is 


dx^ 


=  0 


and 


(E.ll) 


^  *  2a  ^  =  0 

3  dx 

dx 

The  second  equation  of  (E.ll)  is  derived  from  the  first  equation  of 
(1.10).  Substituting  (E.T),  one  finds  that  the  eigenvalue  is  given  by 
n  =  it/6 


i.e. 

a  =  1/2 


(E.12) 


The  deformation  for  this  case  is 

w{x)  =  A  exp(-x/2)  cos[(/3tt/2)  +  (v/6)]  (E.13) 

where  A  is  arbitrary.  The  maximum  of  u(x)  occurs  at 
X  =  h-n/{3'^) 

Therefore  the  deformation  in  this  section  is  different  from  the  deformation 
in  the  preceding  section. 

For  the  simple-edge  condition,  the  eigenvalue  is  given  by 
a_  -  1 

which  we  do  not  accept. 
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Nonlinear  Theory  of  the  Response  of 
Pavements  to  Vibratory  Loads 


Richard  A.  Weiss 

Pavement  Investigations  Division 
Soils  and  Pavements  Laboratory 
U.  S.  Army  Engineer  Waterways  Experiment  Station 
Vicksburg,  Mississippi 


abstract,  a  nonlinear  model  of  the  pavement  response  to  a  dynamic 
load  is  presented  which  has  applications  to  the  vibratory  nondestructive 
method  of  testing  pavements.  The  parameters  of  the  model  have  been 
determined  by  comparison  with  actual  dynamic  load-deflection  curves.  The 
model  gives  a  quantitative  description  of  the  dependence  of  the  measured 
dynamic  load-deflection  curves  on  the  strength  of  the  pavement,  static 
load  of  the  vibrator,  and  the  frequency  of  operation  of  the  vibrator. 

The  model  determines  the  elastic  modulus  of  the  subgrade  from  the  measured 
load-deflection  curves.  The  nonlinear  dynamical  model  is  applied  to  the 
laboratory  determination  of  the  resilient  modulus  with  the  result  that 
the  resilient  modulus  is  expressed  analytically  in  terms  of  the  static 
confining  pressure,  dynamic  deviator  stress,  and  material  parameters  which 
describe  the  linear  and  nonlinear  behavior  of  soil  under  dynamic  and  static 
force  loading. 

I.  INTRODUCTION 

The  Waterways  Experiment  Station  (WES)  has  for  many  years  used  the 


This  method  of 


method  of  nondestructive  testing  of  airfield  pavements . 
testing  pavements  is  relatively  quick  accurate,  reproducible,  and  in¬ 
expensive.  When  the  nondestructive  test  method  is  used  an  airfield 
runway  need  not  be  shut  down  for  long  periods  of  time  as  is  the  case  for 
the  destructive  testing  of  pavements. 

The  instrument  used  for  the  vibratory  nondestructive  testing  of  pave¬ 
ments  is  a  mechanical  vibrator  whose  force  payload  to  the  pavement  surface 
is  generated  either  by  a  hydraulic  system  or  a  mechanism  of  counter-rotating 
weights.  The  WES  16-kip  vibrator  applies  a  static  load  of  16  kips  to  the 
pavement  surface  and  a  dynamic  load  to  the  pavement  surface  which  can  be 
varied  from  0  to  15  kips.  Both  static  and  dynamic  loads  are  applied  to 
the  pavement  surface  through  a  circular  18-in.  diameter  baseplate. 

Four  types  of  nondestructive  tests  are  generally  performed  on  pave¬ 
ments,  and  these  consist  of  the  following  measurements: 

a.  Dynamic  load-deflection  curves  giving  the  dynamic  amplitude 
as  a  function  of  the  dynamic  load. 

b.  Frequency  response  spectrum  giving  the  dynamic  amplitude 
as  a  function  of  frequency  for  a  fixed  dynamic  load. 

c_.  Deflection  basin  measurements. 

Rayleigh  wave  dispersion  curves  giving  phase  velocity 
versus  wavelength. 

Only  the  dynamic  load-deflection  curves  and  the  frequency  response  spectrum 
measurements  will  be  considered  in  this  paper. 
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A  typical  measured  frequency  response  curve  appears  in  Fig.  1,  and 
a  typical  measured  dynamic  load-deflection  curve  appears  in  Fig.  2.  Most 
of  the  WES  measurements  of  the  dynamic  load-deflection  curves  were  done  at 
a  frequency  of  15  Hz.  Experience  has  shown  that  the  dynamic  load-deflection 
curves  are  relatively  smooth  for  this  frequency.  The  frequency  response 
spectrum  may  contain  multiple  resonance  peaks. 

Two  basic  theoretical  approaches  have  been  taken  to  describe  the  ex- 

perimental  data: 

1.  a  linear  theory  of  the  frequency  response  spectrum 

2.  a  nonlinear  theory  of  the  dynamic  load-deflection  curves 

The  two  types  of  dynamic  pavement  response  models  that  have  been  considered 
are  shown  in  Fig.  3.  Single-mass  and  multiple-mass  models  have  been  devel¬ 
oped  in  the  linear  theory,  while  only  a  single  mass  model  was  developed 
with  a  nonlinear  spring  constant.  It  was  found  that  multiple-mass  pavement 
response  models  are  somewhat  intractable  because  they  contain  many  para¬ 
meters.  Only  the  single-mass  pavement  response  models  are  considered  in 
this  paper.  The  elements  of  the  spring-mass-dashpot  model  must  be  determined 
in  terms  of  the  characteristic  forms  of  the  measured  frequency  response 
spectrum  and  the  measured  dynamic  load-deflection  curves. 

II.  DYNAMIC  FREQUENCY  RESPONSE  THEORY 

The  dynamic  frequency  response  spectrum  measured  at  the  pavement  sur¬ 
face  is  often  quite  complex  and  difficult  to  interpret.  Many  factors  prob¬ 
ably  contribute  to  produce  its  characteristic  shape.  In  order  to  extract 
some  information  about  pavement  and  subgrade  structure  from  the  measured 
dynamic  frequency  response  spectrum  it  is  necessary  to  use  a  simple 
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dynamic  pavement  response  model  to  fit  the  measured  frequency  response 
spectrum  with  the  theoretically  predicted  frequency  response  spectrum.  This 
fit  will  yield  the  parameters  of  the  dynamic  model  from  which  the  pavement  and 
subgrade  structure  can  be  determined.  The  frequency  response  spectrum  of 
the  single-mass  model  has  one  resonance  peak,  and  this  predicted  resonance 
peak  is  fit  to  the  second  resonance  peak  of  the  measured  response  spectrum. 

The  second  peak  is  chosen  because  an  examination  of  many  frequency  response 
spectra  has  shown  this  peak  to  be  more  consistent  and  less  affected  by 
electronic  equipment  than  the  other  peaks.  Generally  the  second  peak  is 
the  most  pronounced. 

The  second  resonance  peak  is  associated  with  a  resonance  frequency  and 
a  resonance  amplitude  as  indicated  in  Fig.  4.  The  resonance  amplitude  and 
frequency  was  used  to  calculate  the  elements  of  the  spring  model  —  effective 
mass,  effective  spring  constant,  and  effective  damping  constant.  The  elements 
of  the  single-mass  spring  model  can  be  simply  related  to  the  resonance  peak. 
DETERMINATION  OF  ELEMENTS  OF  THE  SPRING  MODEL 

Within  the  framework  of  the  single-mass  spring  model^  ^  the  dynamic 
amplitude  of  the  pavement  surface  response  to  a  sinusoidal  dynamic  load 
can  be  written  as 


A  -  F„/S 

S  =  •f  (k-mo)^)^  + 


(1) 

(2) 


where  A  amplitude  of  the  dynamic  displacement  of  the  pavement  surface  as 
represented  by  a  linear  spring  model,  =  dynamic  load  applied  to  the  pave 


422 


ment  surface,  S  =  dynamic  stiffness,  k  =  linear  spring  constant,  m  = 
effective  mass  of  the  pavement- subgrade  system,  m  =  angular  frequency,  and 
C  -  damping  coefficient.  The  resonance  frequency  and  amplitude  can  be 
obtained  from  (1)  and  (2)  to  be 


"  2ir 


Y  m 


2d2 


(3) 


D 


(A) 


2kD/  L  -  d2 


(5) 


2/lan 


where  f„  =  resonance  frequency,  =  responance  amplitude,  and  D  damp 

ing  ratio.  The  three  elements  of  the  linear  spring  model  that  are  to  be 
obtained  are  k  ,  m  and  C  .  In  order  to  determine  these  three  parameters 
another  piece  of  information,  in  addition  to  and  .  is  necessary. 

This  Is  given  by 


J(m)  =  Aj^/A 

where  J(u)  =  ratio  of  the  resonance  amplitude  to  the  amplitude  at  some 
nearby  frequency.  The  theoretical  value  of  this  ratio  xs  given  by 


(6) 


J(k,m,C,a))  = 


(k  -  mto^)^  +  C^ti)^ 


(k  -  mu)^)  +  C^o)^ 

R  R 


(7) 


f 
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The  three  measured  quantities  which  are  extracted  from  the  frequency  response 
curve  are  fj^  ,  and  J(a)). 

The  spring  model  elements  k  ,  m  and  C  must  now  be  obtained  In  terms 

\  equations  In  (3)  -  (6)  can  be  Inverted  to  deter- 

mine  k  and  D  in  the  following  manner 


k  =  ATT^mf^ 
R 


(8) 


(9) 


The  k  and  D  terms  have  now  been  expressed  In  terms  of  the  effective  mass. 
Using  (8)  and  (9)  It  Is  now  possible  to  express  J(k,m,C,m)  In  terms  of  the 
effective  mass  as  the  only  unknown  parameter  as  follows 


J(m,a)) 


(10) 


The  only  unknown  Independent  variable  In  J(m,a))  Is  now  the  effective  mass. 
By  sweeping  through  a  series  of  values  of  m  and  calculating  numerical 
values  of  J(m,a))  It  Is  possible  to  determine  the  specific  value  of  m  for 
which  J(n,co)  is  equal  to  the  experimental  value  of  the  J-ratlo,  i.e.. 
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J(m,io)  =  J((d).  This  condition  determines  the  value  of  the  effective  mass 
required  by  the  spring-mas s-dashpot  model  to  fit  the  experimentally  mea¬ 
sured  dynamic  frequency  response  curve.  Placing  this  calculated  value  of 
the  effective  mass  into  (5) ,  (8)  and  (9)  gives  the  proper  values  of  k  and 
C  required  to  fit  the  experimental  frequency  response  data.  The  necessary 
computer  programs  to  accomplish  this  work  on  a  digital  computer  have  been 
developed  and  will  be  referred  to  as  the  WES  Dynamic  Frequency  Response 

Program. 

DETERMINATION  OF  SUBGRADE  MODULUS  BY  FREQUENCY  RESPONSE  METHOD 

The  value  of  the  spring  constant  that  is  determined  from  the  measured 
frequency  response  spectrum  will  be  used  to  determine  the  subgrade  modulus. 
The  theory  of  the  linear  elastic  layered  half-space  predicts  a  theoretical 
value  of  the  static  spring  constant  k^  which  depends  on  the  radius  of 
the  loaded  area  and  on  the  elastic  constants  of  the  subgrade  and  the  pave¬ 
ment  layers.  Computer  programs  are  available  which  calculate  the  value 
of  k^  if  the  Young's  modulus  and  Poisson's  ratio  of  each  layer  of  the 
half-space  is  known.  A  well  known  computer  program  of  this  kind  is  the 
Chevron  Program.  The  procedure  for  determining  the  Young's  modulus  Eg 
of  the  subgrade  is  shown  in  Fig.  5.  The  measured  values  of  Ir  .  \  and 
J(u))  are  inserted  into  the  WES  Dynamic  Frequency  Response- Program  and 
values  of  k  ,  m  and  C  are  determined.  The  Young's  modulus  and  Poisson's 
ratio  of  the  layers  of  the  pavement  are  selected  and  entered  into  the 
Chevron  Program.  The  subgrade  modulus  Eg  is  then  Iterated  in  the  Chevron 
Program  and  a  series  of  values  of  k^  are  determined.  The  proper  value  of 
is  determined  by  the  condition 
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(11) 


The  predicted  value  of  will  depend  on  the  values  of  the  elastic  moduli 

selected  for  the  pavement  layers. 

NUMERICAL  RESULTS  OF  FREQUENCY  RESPONSE  METHOD 

Values  of  k  ,  m  ,  C  and  Eg  have  been  obtained  for  several  airport 
pavement  sites  and  are  listed  in  Table  I.  This  table  has  listed  the  sites 
according  to  increasing  values  of  the  Dynamic  Stiffness  Modulus  (DSM) ,  which 
is  the  slope  of  the  djmamic  load-deflection  curves  at  a  dynamic  load  of  14 
kips.  It  is  seen  that  the  measured  spring  constant  k  increases  with  in¬ 
creasing  pavement  strength  and  that  k  is  not  equal  to  the  DSM  value.  The 
effective  mass  is  presented  as  a  ratio  to  the  above-surface  (vibrator)  mass, 
and  increases  with  the  strength  of  the  pavement.  The  effective  mass  is  not 
equal  to  the  above-surface  mass  and  any  theory  which  aprlori  assumes  that 
m  -  cannot  be  used  to  fit  the  experimental  frequency  response  data. 

The  value  of  the  damping  constant  also  increases  with  increasing  pavement 
strength.  The  predicted  values  of  Eg  are  compared  to  those  modulus  values 
that  are  predicted  by  the  CBR  method  (E  =  1500  CBR) .  The  values  of  E 

s 

predicted  by  the  combined  WES  Frequency  Response  Program  and  the  Chevron 
Program  are  3  to  5  times  larger  than  those  predicted  by  the  CBR  method. 

There  are  several  possible  reasons  for  the  discrepancy  in  the  values 
of  Eg  predicted  by  these  two  methods: 

the  pavement-subgrade  system  is  nonlinear  under  dynamic  and 
static  loading 

b.  the  subgrade  is  not  uniform  and  the  theoretical  layered 
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elastic  half-space  model  requires  a  rigid  boundary  below 
the  subgrade 

c.  reflections  from  a  lower  boundary  layer  add  to  the  motion 
of  the  pavement  surface 

When  a  rigid  boundary  such  as  bedrock  is  present  relatively  close  to  the 
pavement  surface  it  is  possible  that  the  effects  listed  in  and  £.  may  be 
of  importance  for  determining  the  motion  of  a  pavement  surface  that  is  sub¬ 
jected  to  a  sinusoidal  dynamic  loading.  However,  the  discrepancy  between 
the  values  of  E„  predicted  by  the  CBR  method  and  that  predicted  by  the 
frequency  response  spectra  method  also  occurs  in  cases  where  the  subgrade 
is  relatively  uniform  and  contains  no  obvious  discontinuities.  Therefore 
only  the  fact  that  the  response  of  pavements  and  subgrades  to  dynamic  and 
static  loads  is  nonlinear  remains  as  a  possible  explanation  for  the  dis¬ 
crepancy  in  the  values  of  Eg  determined  by  these  two  methods . 

III.  NONLINEAR  THEORY  OF  PAVEMENT  RESPONSE  TO  DYNAMIC  SURFACE  LOADINGS 
An  alternative  method  for  determining  the  subgrade  modulus  from 
vibratory  nondestructive  test  data  is  the  use  of  the  dynamic  load-deflection 
curves  measured  at  the  pavement  surface  for  a  fixed  frequency  and  a  fixed 
static  load.  These  dynamic  load-deflection  curves  are  generally  nonlinear 
for  weak  pavements  and  become  more  linear  for  stronger  pavements.  Over  the 
years  the  WES  has  collected  an  extensive  set  of  dynamic  load-deflection  curves 
that  have  been  obtained  on  many  airfield  pavements  throughout  the  country. 

The  nonlinear  dynamic  load-deflection  curves  were  measured  at  a  fre¬ 
quency  of  15  Hz  and  at  a  static  surface  loading  of  16  kips.  The  nonlinear 
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dynamic  theory  must  account  for  the  frequency  and  static  load  conditions 
under  which  the  dynamic  load-deflection  curves  were  measured.  The  pre¬ 
dicted  subgrade  modulus  should  be  free  of  the  particular  loading  charac¬ 
teristics  of  the  vibrator.  Therefore,  in  addition  to  the  static  Young's 
modulus  some  other  parameters  have  to  be  introduced  which  will  account 
for  the  observed  nonlinearity  of  the  dynamic  load— deflection  curves. 

These  nonlinear  parameters  must  also  account  for  the  nonlinear  behavior  of 
the  static  load-deflection  curves.  The  predicted  subgrade  modulus  value 
will  be  independent  of  the  particular  loading  characteristics  of  the 
vibrator  —  frequency,  static  load,  and  dynamic  load.  Only  the  natural 
overburden  pressure  will  be  reflected  in  the  subgrade  modulus  value. 

The  determination  of  the  elastic  constants  and  the  static  and  dynamic 
Parameters  of  the  pavements  and  subgrades  from  measured  dynamic 

load-deflection  data  requires  a  nonlinear  dynamic  theory  of  pavement 

4 

response  . 

EQUATION  OF  MOTION  OF  A  NONLINEAR  OSCILLATOR 

The  nonlinear  theory  of  pavement  response  to  a  vibratory  load  assumes 
that  the  pavement— subgrade  system  can  be  described  by  a  lumped  mass  non¬ 
linear  oscillator  whose  equation  of  motion  is  written  as 

mic  +  Cx  +  kjjpX  +  bx3  +  ex5  =  Fjj  +  (12) 

where  m  =  effective  mass  of  the  pavement-subgrade  system,  x  =  total 
displacement  of  the  pavement  surface  beneath  the  vibrator  baseplate,  C  = 
damping  constant,  =  linear  spring  constant,  b  =  third  order  non¬ 

linear  pa-.'einent  parameter,  e  =  fifth  order  nonlinear  pavement  parameter, 
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=  dynamic  load  applied  to  the  pavement  surface,  and  Fg  =  static  load 
applied  to  the  pavement  surface.  The  total  displacement  of  the  pavement 
surface  is  decomposed  into  a  static  and  a  dynamic  part  as  follows 


X  =  +  5 

e 


(13) 


where  x  =  static  elastic  displacement  of  the  pavement  surface,  and  ?  - 
e 

dynamic  elastic  displacement  of  the  pavement  surface.  Placing  (13)  into  (12) 
gives  the  following  equation  of  motion 


m'(  +  C5  +  +  3bx^  +  Sex^js  +  +  eS®  +  Cg(x^,5) 


=  F, 


(14) 


where 


g(Xg,5)  =  3bx^C  +  10ex|5  +  100x^52  +  Sex^?"’ 

For  convenience  in  manipulating  (14)  it  is  necessary  to  use  a  time 
averaged  expression  for  (15) 

g(x  ,0  =  3a^bx2  +  Sa^ex^  +  a3b52  +  a^^eC** 


(15) 


(16) 


where  a  ,  a  ,  a  and  a  are  coefficients  to  be  determined  from  the  mea- 
12  3^ 

sured  dynamic  load-deflection  data.  Combining  (16)  and  (14)  gives  the  motion 


equation  as 


ml'  +  C|  +  +  be?^  +  en5^  =  Fp 


(17) 


where 


=  k„-  +  3be,x2  ^  5ee  x^ 
0  00  2  e  4  e 


(18) 
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(19) 

(20) 


0  =  1  +  33 

n  =  1.  +  3^ 

£2  =  1  +  (21) 

=  1  +  a^  (22) 


The  parameters  0  ,  n  ,  and  depend  on  the  pavement  strength  and  are 

determined  by  requiring  (17)  to  adequately  describe  the  dynamic  load-deflection 
curves.  The  nonlinear  parameters  b  and  e  determine  the  static  load-deflec¬ 
tion  curves,  as  can  be  seen  from  (12) 


F„  =  k  X  +  bx3  +  ex^  (23) 

S  00  e  e  e 

In  general  it  is  found  that  b  <  0  and  e  >  0  for  pavements  and  most  sub¬ 
grades. 

THEORY  OF  DYNAMIC  LOAD -DEFLECT I ON  CURVES 

The  problem  remains  to  solve  the  nonlinear  equation  (17) .  This  can 
be  done  by  casting  (17)  into  an  equivalent  linear  form  for  which  the  dynamic 
amplitude  is  given  by 


c  -  V® 

where 


(24) 


S  =  y^(k  -  mo)^)  ^ 


4* 


(25) 


where  S  =  dynamic  stiffness,  k  =  dynamic  spring  constant,  m  =  effective 
mass,  0)  =  angular  frequency  and  C  =  damping  constant.  The  requirement  that 

(24)  and  (25)  be  a  solution  of  (17)  is  that  the  spring  constant  in  (25)  is 

,  4 

given  cy 
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(26) 


k  =  ko  +  I  605^  +  I  enS** 

Therefore  the  spring  constant  for  a  nonlinear  system  depends  on  the  dynamic 

and  static  displacements  of  the  pavement  surface. 

Placing  (26)  into  (25)  and  (24) ,  and  solving  for  the  dynamic  amplitude 
4 

yields  the  result 


As  seen  from  (27)  -  (30)  the  degree  of  nonlinearity  of  a  dynamic  load- 
deflection  curve  depends  on  the  strength  of  the  pavement  and  the  frequency 
of  operation  of  the  vibrator.  The  strength  of "the  pavement -affects  the 
degree  of  nonlinearity  of  the  dynamic  load-deflection  curves  through  the 
term  S"**  that  appears  in  (27)  and  (29).  The  S'**  term  shows  that  strong 
pavements  tend  to  be  more  linear  than  weak  pavements.  From  (30)  it  is  clear 
that  there  is  a  critical  frequency  for  which  the  first  order  nonlinear  term 
vanishes  and  this  frequency  is  given  by 
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(32) 


At  this  frequency  the  dynamic  load-deflection  curves  should  become  especially 
linear  in  the  regions  of  low  dynamic  force  if  the  second  order  nonlinear  term 
is  comparatively  small.  The  straightening  effect  at  the  critical  frequency 
will  not  be  strongly  evident  if  the  second  order  nonlinear  term  is  compara¬ 
tively  large. 

DYNAMIC  NATURE  OF  THE  SPRING  CONSTAI^T 

The  measurement  of  the  dynamic  load-deflection  curves  determine  the 
linear  and  nonlinear  parameters  of  a  pavement  system  —  ,  b  ,  e  ,  0  , 

>  ^4  •  Equation  (26)  shows  that  the  spring  constant  k  that  is 
determined  from  a  dynamic  analysis  of  the  nonlinear  properties  of  a  pave- 
ment-subgrade  system  is  dependent  on  the  dynamic  and  static  displacements 
of  the  pavement  surface  as  well  as  on  the  elastic  constants  of  the  pave¬ 
ment-subgrade  system.  Therefore  the  spring  constant  k  that  is  determined 
from  the  dynamic  response  of  a  nonlinear  pavement  system  is  a  dynamic 
quantity  that  is  not  analogous  to  an  ordinary  static  spring  constant.  The 
theoretical  static  spring  constant  determined  from  a  static  linear  elastic 
program  such  as  the  Chevron  Program  will  depend  only  on  the  elastic  constants 
of  the  pavement.  Therefore  the  value  of  k  determined  from  the  dynamic 
response  data  of  a  nonlinear  pavement  cannot  logically  be  compared  to  the 
static  k^  value  determined  from  static  layered  elastic  computer  programs. 
Static  plate  bearing  tests  will  result  in  a  spring  constant  which  will  also 
not  be  directly  comparable  to  the  spring  constant  determined  from  an  analysis 
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of  dynamic  data. 

FINITE  DEPTH  OF  INFLUENCE 


The  static  linear  and  nonlinear  parameters  ,  b  and  e  respec¬ 

tively  can  be  related  to  the  elastic  parameters  of  the  pavement  layers  and 

4 

to  the  depth  of  influence  of  the  static  stress-strain  field  .  The  finite 
depth  of  influence  is  written  in  terms  of  the  static  deflection  of  the 
pavement  surface  as 

^  =  y 0  +  +  /x**  (33) 

u  2  e  4  e 

/ 

For  the  simplest  case  of  a  vibrator  placed  on  the  surface  of  a  subgrade, 
the  static  parameters  are 


=  27ra^ii;(l  -  v)G 
00  "  /qCI  -  2v) 

4ira^i|)y(l  -  v)G 

b  - - ^ - 

£-^{X  -  2v) 


(34) 


(35) 


6Tra^i|;6(l  -  v)G  /3g) 

y^d  -  2v) 


where 


6  = 


(37) 


and  ^  =  volume  factor  for  the  frustum  of  the  cone  of  stress  and  strain.  It 
is  through  equations  similar  to  (34)  -  (37)  that  the  connection  is  made  be¬ 
tween  the  elastic  parameters  of  the  pavement  system  and  the  theoretical 
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expression  for  the  dynamic  stiffness  as  given  by  (25)  and  (26). 

MODEL  PARAMETERS 

The  model  parameters  k  ,  m  ,  C  ,  ,  b  ,  e  ,  ij,  ,  ,  9  , 

n  ,  Ej  and  depend  on  vibrator  characteristics  and  on  the  structure  of 

the  pavement  and  subgrade.  This  dependence  is  in  general  very  complicated 
and  difficult  to  determine  theoretically.  The  simplest  way  to  attach  the 
model  parameters  to  the  strength  of  a  pavement-subgrade  system  is  to  deter¬ 
mine  these  parameters  in  terms  of  the  measured  dynamic  stiffness  modulus 
(DSM)  of  a  pavement.  The  DSM  is  the  slope  of  the  load-deflection  curve 
measured  by  the  WES  16-kip  vibrator  in  the  region  of  large  dynamic  load; 
it  is  in  fact  the  tangent  modulus  of  the  d3Tiamic  load-deflection  curves  for 
^  kips.  The  DSM  value  is  a  suitable  choice  for  a  parameter  in  terms  of 
which  to  describe  the  model  parameters  because  it  is  a  measure  of  the  bulk 
of  the  pavement  and  subgrade.  The  model  parameters  expressed  in 
terms  of  the  measured  DSM  correspond  to  the  WES  16-kip  vibrator.  The 
vibrator  characteristics  appear  in  these  parameters  because  the  subgrade 
modulus  to  be  determined  is  Intended  to  be  independent  of  the  dynamic  char¬ 
acteristics  of  the  vibrator.  A  corresponding  set  of  vibrator  parameters 
will  have  to  be  developed  for  any  other  vibrator  that  is  to  be  used  for 
nondestructive  testing  of  pavements. 

The  model  parameters  are  presented  as  a  function  of  the  measured  DSM  in 
Figs.  6  through  15.  From  these  figures  it  is  seen  that  k  ,  m  ,  C  and  ^ 
are  increasing  functions  of  the  strength  of  the  pavement.  The  dynamic  spring 
constant  presented  in  Fig.  6  corresponds  to  a  dynamic  load  of  15  kips.  The 
depth  of  influence  of  the  static  stress-strain  field  increases  with  increasing 
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pavement  strength  while  the  static  deflection  of  the  pavement  surface 
under  a  fixed  static  load  decreases  with  increasing  strength.  As  seen 
from  Fig.  7  the  effective  mass  is  generally  much  larger  than  the  above¬ 
surface  mass,  and  it  would  be  incorrect  to  assume  that  the  only  lumped- 
mass  of  the  vibrator-pavement-subgrade  system  is  the  vibrator  mass  itself. 
The  effective  mass  of  the  dynamic  model  Includes  the  inertial  effects  of 
the  mechanical  radiation  field  in  the  pavement  and  subgrade.  In  all  cases 
of  the  pavements  investigated  it  was  found  that  b  <  0  and  e  >  0  . 
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DETERMINATION  OF  SUBGRADE  MODULUS  FROM  DYNAMIC  LOAD-DEFLECTION  CURVES 

The  nonlinear  dynamic  response  model  that  has  been  outlined  in  the 
preceding  section  can  be  used  in  conjunction  with  a  dynamic  load-deflection 
curve  measured  at  the  pavement  surface  to  determine  the  modulus  of  the  sub¬ 
grade  beneath  the  pavement.  A  computer  program  has  been  developed  which 
calculates  the  theoretical  dynamic  response  of  a  pavement  in  terms  of  the 
elastic  moduli  of  the  pavement  layers  and  subgrade  and  in  terms  of  the 
empirically  determined  parameters  6  ,  n  ,  ,  m  and  C  which  have 

been  expressed  in  terms  of  the  measured  DSM  values  of  the  pavement.  A 
typical  example  of  the  vibratory  nondestructive  input  data  to  the  computer 
program  is  shown  in  Table  II.  The  computer  program  calculates  a  theoretical 
load-deflection  curve  in  terms  of  the  b  and  e  coefficients  that  are 
determined  from  measured  load-deflection  curves  and  in  terms  of  the  elastic 
moduli  of  the  pavement  layers  and  the  sub grade.  The  elastic  moduli  of  the 
pavement  layers  are  selected  from  laboratory  tests  and  CBR  measurements. 

The  subgrade  modulus  is  then  determined  by  requiring  that  the  theoretically 
predicted  dynamic  load-deflection  curve  agree  with  the  measured  dynamic  load- 
deflection  curve.  This  procedure  for  determining  the  subgrade  modulus  is 
shown  in  Fig.  16.  The  numerical  results  of  this  procedure  for  a  few  pavement 
sites  are  presented  in  Table  III.  The  values  of  the  subgrade  modulus  pre¬ 
dicted  by  the  nonlinear  dynamic  response  theory  are  in  general-  agreement  with 

those  predicted  by  the  empirical  relation  E„  =  1500  CBR. 

o 

IV.  LABORATORY  CONFIRMATION  OF  VIBRATORY  NONDESTRUCTIVE  FIELD  TEST  DATA 
It  is  of  Interest  to  be  able  to  correlate  the  laboratory  value  of  the 
resilient  modulus  M^  of  a  soil  sample  taken  from  the  subgrade  at  a  pavement 
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or  soil  site  for  which  the  subgrade  modulus  has  been  predicted  by  the  vibratory 
nondestructive  testing  method.  Such  a  correlation  is  difficult  to  achieve 
because  the  loading  conditions  on  the  soil  sample  for  the  laboratory  tests  are 
different  from  the  loading  conditions  on  the  subgrade  during  vibratory  non¬ 
destructive  testing.  The  loading  conditions  differ  in  terms  of  the  magnitude 
of  the  static  and  dynamic  stresses  and  in  terms  of  the  frequency  of  application 
of  the  dynamic  stress. 

In  its  natural  state,  an  element  of  soil  in  the  subgrade  is  subjected 
only  to  the  overburden  pressure.  When  a  vibrator  is  operated  on  the  surface 
of  a  pavement  or  subgrade,  an  additional  static  and  dynamic  stress  is  applied 
to  an  element  of  soil  in  the  subgrade.  For  the  WES  16-kip  vibrator  the  static 
load  applied  to  the  surface  is  16  kips,  while  the  dynamic  load  can  be  varied 
up  to  15  kips  and  is  applied  sinusoidally  with  a  frequency  of  15  Hz.  The 
stress  field  in  the  subgrade  is  nonuniform  and  can  be  calculated  by  standard 
elasticity  theory. 

The  laboratory  sample  for  resilient  modulus  testing  is  cylindrical  in 
shape  with  a  typical  diameter  of  3  inches  and  a  length  of  6  inches.  The 
cylindrical  sample  is  subjected  to  a  static  confining  pressure  and  then  a 
dynamic  load  is  applied  in  the  axial  direction.  The  stress  is  uniform  along  the 
axis  of  the  laboratory  sample.  The  total  stress  along  the  axis  of  the  labor- 
atory  sample  is  written  as 

a  =  a_  + 

D  b 

where  =  dynamic  stress  in  axial  direction  of  sample,  and  =  confining 
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pressure.  The  axial  dynamic  stress  is  also  called  the  deviator  stress  and 

is  written  as  ~  ~  >  where  a  =  total  stress  along  the  axis  of  the 

specimen.  The  resilient  modulus  has  been  measured  for  a  number  of  soil 

and  pavement  materials,  and  has  been  found  to  depend  on  Og  and  . 

The  dependence  of  M  on  the  dynamic  deviator  stress  is  such  that  M  at 

^  r 

first  decreases  with  increasing  values  of  ,  attains  a  minimum  value,  and 
then  increases  with  further  increase  of  the  deviator  stress 

The  dynamic  stress  acting  along  the  axial  direction  of  the  soil  specimen 
during  the  laboratory  resilient  modulus  test  is  applied  as  a  series  of  pulses 
in  the  form  of  haversines  with  a  pulse  of  1  second  duration  being  applied 
every  3  seconds.  The  characteristic  frequency  of  the  dynamic  loading  on  the 
sample  will  therefore  be  in  the  range  of  0.3  -  1.0  Hz,  and  this  is  much  lower 
than  the  frequency  of  15  Hz  at  which  the  vibratory  nondestructive  field  tests 
are  conducted.  The  large  difference  in  the  frequencies  used  for  these  two 
types  of  tests  requires  that  an  adequate  account  of  frequency  effects  be 
included  in  the  theoretical  analysis  of  both  laboratory  and  field  vibratory 
tests. 

NONLINEAR  DYNAMICAL  ANALYSIS  OF  THE  RESILIENT  MODULUS  TEST 

A  dynamical  theory  of  the  resilient  modulus  test  has  been  developed  which 
is  similar  in  form  to  the  analysis  developed  for  the  vibratory  nondestructive 
field  tests.  The  basic  result  of  this  theory  is  that  the  dynamic  displacement 
of  the  test  specimen  can  be  written  as 

(39) 

(«0) 


S  =  \/(k  -  mm^)^  +  c^o)^  ' 
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where  5  ,  and  =  resilient  dynamic  displacement,  dynamic  load,  and 

dynamic  stress  on  the  cylinder  end  in  the  axial  direction;  S  ,  k  ,  m  ,  C 

and  A  =  dynamic  stiffness,  spring  constant,  effective  mass,  damping 
C 

constant,  and  area  of  loaded  end  of  the  cylinder  respectively;  u  =  effective 
angular  frequency  component  of  the  dynamic  load  applied  to  the  soil  sample. 

The  nonlinear  theory  of  vibrations  that  was  outlined  earlier  in  this 
paper  for  the  vibratory  nondestructive  field  tests  can  also  be  used  to  calculate 
the  quantities  in  (39)  and  (40).  This  nonlinear  theory  shows  that  the  spring 
constant  is  given  by 

k  =  k  +  |be5^  +  genC^* 

0  ^  o 

"  k„  =  k..  +  3be  xf  +  See  <^2) 

0  00  2  e  4  e 

where  b  ,  e  ,  6  ,  n  ,  £2  ^4  “  parameters  which  characterize  the  soil 

sample,  and  x^  =  resilient  static  displacement  of  the  soil  sample  in  the 
axial  direction.  The  coefficients  k^^  ,  b  and  e  could  be  determined  from 
the  resilient  static  stress-strain  curve  if  such  a  curve  could  be  measured. 

The  resilient  static  stress-strain  curve  of  the  soil  sample  is  determined  by 


F  = 
S 


=  k„„x 
00  e 


+  bx^  +  ex' 


(43) 


where  o„  =  static  confining  pressure,  and  F  =  total  static  force  applied 
to  the  cylinder  end, 

4 

The  solution  of  (39)  -  (42)  can  be  written  as 
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(44) 


5 


1 


where 


+  C^o)^ 


(45) 


il>  =  f2/s;;  =  A2a2/S‘» 
Do  C  D  0 


(46) 


(47) 


a 


2 


(48) 


The  dynamic  stiffness  of  the  soil  sample  can  be  obtained  from  (39)  and  (44) 
to  be 


s  =  Sjjd  + 


(49) 


(50) 


(51) 


The  quantities  necessary  for  the  calculation  of  the  resilient  modulus  have 
now  been  determined. 

CALCULATION  OF  THE  RESILIENT  MODULUS 

The  resilient  modulus  is  defined  as  the  slope  of  the  unloading  portion  of 
the  dyniiinic  stress— strain  curve  of  the  soil  sample,  and  is  given  by 
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_  L  dg 


(52) 


where  e^j  =  dynamic  strain  in  axial  direction,  L  =  length  of  the  soil  sample, 
and  A  =  area  of  end  of  the  cylindrical  sample.  In  (52)  C  is  assumed  to 
describe  the  unloading  portion  of  the  resilient  dynamic  load-deflection  curve 
of  the  soil  sample.  Combining  (44)  and  (52)  gives 


V') 


(53) 


where 


(54) 


(55) 


(56) 


For  the  low  frequency  and  small  mass  with  which  the  resilient  modulus  tests 
are  conducted,  the  inertial  and  damping  terms  in  (40)  and  (45)  can  be  neglected 
and  the  following  approximations  can  be  made 


S  'V'  k 


(57) 


S 


0 


(58) 


The  same  approximations  can  be  made  in  (47)  and  (48) .  Combining  (42)  ,  (56)  and 
(58)  gives  the  following  approximation 
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M  E  +  E  x2  +  E 

ro  0  2  6  4  6 

where 


(59) 


The  quantities  E  ,  E 
0  2 

of  the  size  of  the  soil 


E  =  —  k 
0  *^00 


(60) 


(61) 


E  =  ~  5ee 
4 


(62) 


and  Ej^  are  soil  parameters  which  are  independent 
sample  and  machine  characteristics.  The  calculation 


of  the  resilient  Poisson's  ratio  requires  further  study. 

The  expression  for  given  by  (53)  -  (56)  characterizes  the  resilient 

modulus  in  terms  of  Oj^  ,  Og  and  oi  .  The  parameters  required  to  describe 

the  resilient  modulus  are  ,  b  ,  e  ,  6  ,  n  ,  «  %  .  “  .  L  ,  .  These 

parameters  will  depend  on  the  type  of  testing  machine,  size  of  soil  sample,  and 

the  type  of  soil  constituting  the  soil  sample;  and  therefore  the  parameters 

will  have  to  be  determined  for  each  type  of  testing  machine.  Typical  values  of 

the  parameters  describing  a  resilient  modulus  test  as  described  by  (39)  -  (62) 

are  given  for  lean  clay  in  Table  IV.  It  is  possible  to  determine  resilient 

modulus  parameters  which  are  independent  of  the  size  of  the  soil  sample  and 

Independent  of  the  type  of  testing  apparatus.  The  parameters  E  ,  E  and  E 

0  2  4 

that  occur  in  (60)  -  (62)  are  soil  parameters  and  are  independent  of  the  sample 
size  or  loading  conditions.  It  is  the  quantity  E^  that  must  be  compared  with 
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the  value  of  E  determined  from  the  vibratory  load-deflection  curves  that 

s 

were  measured  directly  on  the  subgrade.  The  value  of  Eg  was  determined  in 
a  manner  such  that  its  value  is  independent  of  the  static  and  dynamic  loads 
exerted  by  the  vibrator. 

The  preceding  analysis  shows  that  the  characteristic  shape  of  the  non¬ 
linear  dynamic  load-deflection  curves  measurted  in  the  field  by  the  WES  16-kip 
vibrator  is  due  in  part  to  the  basic  nonlinear  response  of  the  material  in  the 
subgrade  to  dynamic  loads.  The  signs  of  the  coefficients  describing  the  resil¬ 
ient  modulus  test:  >  0  ,  >  0  >  <  P  »  '52  ^  0  >.  <  0  >  e  >  0  , 

determine  to  a  large  extent  the  signs  of  the  corresponding  coefficients  deter¬ 
mined  from  the  vibratory  nondestructive  tests  conducted  on  pavements  and  sub- 
grades.  However,  inertial,  damping  and  frequency  effects  will  affect  the 
values  of  and  determined  by  vibratory  nondestructive  test¬ 

ing.  For  the  vibratory  nondestructive  tests  done  on  pavements  and  subgrades  at 
15  Hz,  it  is  generally  found  that  >  0  and  >  0  which  is  in  agreement 
with  the  signs  of  the  corresponding  coefficients  describing  the  resilient 
modulus  laboratory  test.  For  frequencies  different  from  15  Hz  and  for  excep¬ 
tional  pavement  cases  it  is  found  that  >  0  and  <  0  or  <  0  and 
>  0  .  Therefore,  the  combination  of  the  large  effective  mass  associated 
with  a  pavement  and  subgrade,  and  the  relatively  high  frequency  of  operation 
of  the  WES  16-kip  vibrator  can  produce  a  dynamic  load-deflection  curve  which 
has  a  shape  which  is  considerably  different  from  the  shape  of  the  dynamic  load- 
deflection  curve  measured  in  the  laboratory  during  a  resilient  modulus  test. 

Because  of  the  finite  size  of  the  soil  sample  for  the  resilient  modulus 
test,  the  effective  mass  of  the  soil  sample  is,  to  a  good  approximation,  equal 
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to  the  actual  mass  of  the  sample.  The  effective  mass  that  enters  the  dynamical 
calculations  for  the  vibratory  nondestructive  field  tests  is  generally  quite 
large  compared  to  the  moving  mass  of  the  vibrator  because  of  the  large  inertial 
effects  associated  with  the  pavement  and  subgrade.  The  large  effective  mass 
and  high  frequency  of  the  vibratory  nondestructive  field  tests  Indicate  that 
the  inertial  and  damping  terms  are  comparable  or  larger  than  the  elastic  effects, 
nuD^  -v  k  and  Cm  -v  k  .  The  relatively  small  mass  of  the  soil  sample  used  for 
the  laboratory  resilient  modulus  tests  and  the  low  frequency  at  which  these 
tests  are  conducted  suggest  that  for  this  case,  mm^  «  k  and  Cm  «  k  ,  and 
the  linear  and  nonlinear  elastic  properties  are  measured -directly  in  this  test. 

The  resilient  modulus  tests  combined  with  the  nonlinear  dynamical  theory 
of  these  tests  indicate  that  the  static  nonlinear  elastic  coefficients  b  and 
e  have  the  signs  b  <  0  and  e  >  0  .  It  is  this  basic  property  of  soils  that 
is  responsible  for  making  the  corresponding  coefficients  determined  from  field 
tests  exhibit  the  same  signs.  It  is  the  nonzero  values  of  b  and  e  as  deter¬ 
mined  from  the  resilient  modulus  that  are  responsible  for  the  finite  depth  of 
influence  of  the  static  stress-strain  field  in  the  subgrade  beneath  a  static 
load  placed  on  the  pavement  surface.  The  intrinsic  nonlinearity  exhibited  by 
the  soil  during  the  resilient  modulus  tests  is  responsible  for  the  finite  depth 
of  influence  of  the  static  stress-strain  field  in  an  actual  soil  formation. 

V.  CONCLUSION 

The  nonlinear  dynamic  pavement  response  model  that  is  presented. in  this 
paper  gives  a  quantitative  description  of  the  dynamic  response  of  a  pavement 
surface  unaer  the  action  of  the  dynamic  and  static  load  applied  to  the 
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pavement  surface  by  the  WES  16-kip  vibrator.  The  model  parameters  -  spring 
constants ,  effective  mass ,  damping  constant  and  finite  depth  of  influence  of 
the  static  load  have  been  determined  as  a  function  of  pavement  strength  as 
represented  by  the  measured  DSM.  The  nonlinear  pavement  respons.e  model  gives 
a  theoretical  expression  for  the  pavement  response  in  terms  of  these  parameters 
and  in  terms  of  the  elastic  constants  of  the  pavement  and  subgrade.  For  a 
suitable  choice  of  the  elastic  moduli  of  the  pavement  layers,  it  is  possible 
to  predict  the  value  of  the  subgrade  modulus  from  the  dynamic  load-deflection 

curve  measured  at  the  pavement  surface# 

Of  much  importance  to  pavement  engineers  is  an  estimation  of  the  strength 
and  condition  of  a  subgrade  as  measured  by  its  subgrade  modulus.  The  nonlinear 
elastic  response  model  of  the  dynamic  load-deflection  curve  combined  with 
measured  values  of  this  curve  is  sufficient  to  determine  the  subgrade  elastic 
modulus  quickly  and  accurately.  This  work  was  funded  by  the  Federal 
Aviation  Administration. 
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TABLE  II 


INPUT  OF  WES  NONLINEAR  DYNAMIC  PROGRAM 


SITE  B2A 

DSM  =  700  kips /In 


kips 

A 

In. 

0 

0.0 

2 

0.003 

4 

0.007 

6 

0.011 

8 

0.015 

10 

0.020 

12 

0.025 

14 

0.030 
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TABLE  IV 


PARAMETERS  DESCRIBING  THE  DYNAMIC  CHARACTERISTICS 
OF  THE  RESILIENT  MODULUS  LABORATORY  TEST 


in^ 

6.16 

In 

6.0 

<S 

2 

lb 

5.0 

“l 

lb  sec^/in 

0.013 

“2 

sec~^ 

6.0 

lb /in 

0.468 

^2 

lb  sec/in 

30.0 

6 

Ib/in 

180.0 

n  ■ 

Ib/in 

1.5  X  lO** 

"2 

lb /In 

4.0  X  lO** 

Ib/in 

o 

f— 1 

X 

o 

• 

=0 

lb /in 3 

-2.0  X  10^ 

E 

2 

lb/ in^ 

3.6  X  10^^ 

E 

4 


Ib^/in** 

-5.66  X  10^3 

IbVin® 

1.45  X  10^7 

lb2/in‘^ 

1.89  X  10^3 

lb‘»/in6 

3.5  X  102® 

Ib^/in** 

-1.89  X  10^3 

Ib'^/in® 

7.2  X  102** 

dimensionless 

30.0 

dimensionless 

50.0 

dimensionless 

31.0 

dimensionless 

54.0 

lb/in2 

1.5  X  10** 

Ib/in** 

-2.0  X  10® 

Ib/in® 

9.7  X  10^3 

*Ni  ^^anindHV  dh^vnaq 
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Figiire  1.  Typical  frequency  response  curve 
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Figure  2.  Typical  dynamic  load-deflection  curve 


DYNAMIC  PAVEMENT  RESPONSE  MODELS 


SINGLE  MASS 


DOUBLE  MASS 


1.  LINEAR  SPRING  LINEAR  SPRINGS 

2.  NONLINEAR  SPRING 


Figure  3.  Single  and  double  mass  dynamic  pavement 

response  models 


FREQUENCY  RESPONSE  CURVES 


MEASURED  QUANTITIES:  fp,  =  RESONANCE  FREQUENCY 

Ar  =  DYNAMIC  AMPLITUDE  AT  RESONANCE 

J  (f)  =  Ar/A  =  RATIO  OF  AMPLITUDE  AT 
RESONANCE  TO  AMPLITUDE 
AT  ARBITRARY  FREQUENCY. 


Figure  U. 


Measiared  quantities  obtained  from  frequency 
response  curves 
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DYNAMIC  AMPLITUDE  A 


FREQUENCY  RESPONSE  METHOD 


BALTIMORE  (B2) 
T  =  77”  F 


AC  Ej  =2.0x105  p/j  =0.30  Hi  =  5 
black  E2  =  2.0x105  =  H2=7 


Ej  =  65  X  105  psi 


Figure  5.  Method  of  calculating  suhgrade  modulus  from 
measured  frequency  response  c\u:ves 
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10"  LB/IN. 


Figure  6.  Dynamic  spring  constant  versus  measured  DSM 
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Figure  8.  Damping  constant  versus  measiired  values  of  DSM 


1000  1500  2000  2500  3000  3500 

DSM,  KIPS/IN. 


Figure  9.  Linear  static  spring  constant  versus  DSM  value 
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DSM,  KIPS/IN. 

nonlinear  parameter  versus  DSM  value 


60 


,  LB/IN. 


DSM,  KIPS/IN. 

Fig^lre  11.  Fifth  order  static  nonlinear  parameter  versus  DSM  value 
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Fig\ire  13.  Dimensionless  parameter  0  versus  measured  values  of 

values  of  DSM 
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Figure  15.  Critical  frequency  versus  measured  DSM 


DYNAMIC  AMPLITUDE  A 


DYNAMIC  LOAD -DEFLECTION  METHOD 


BALTIMORE  (B2) 
T=77"F 


^BA^SE*^  E2  -  2.0  X  105  .  0.35  =  7 


GW-GM  El  =1.0x105  ^3  =  0.35  H3  =  9 


-  ".35 


DYNAMIC  LOAD,  KIPS 


INPUT  DATA 

1.  DSM  VALUE 

2.  POINT  BY  POINT  TABULATION 
OF  LOAD -DEFLECTION  CURVE 


WES  NONLINEAR  DYNAMIC 
LOAD- DEFLECTION  PROGRAM 


OUTPUT  DATA 
k  M  C  Ec 


Ej  =  22.8  X  103  PSI 


Figure  16.  Determination  of  sutgrade  modulus  from  measured 
dynamic  load-deflection  curves 


CHARACTERIZATION  OF  BEHIND  ARMOR  EFFECTS  FOR 
LONG  ROD  PENETRATORS 


Victor  D.  Maki 
Engineering  Branch 
Ballistic  Modeling  Division 
US  Army  Ballistic  Research  Laboratory 
Aberdeen  Proving  Ground,  Maryland 


ABSTRACT.  This  study  was  needed  to  provide  information  on  the 
behind  armoT  effects  essential  to  armored  vehicle  analysis  and  in  the 
design  of  future  kinetic  energy  penetrators.  Both  spall  and  rod  pene- 
trator  fragment  data  was  examined  for  gross  characteristic  statistical 
trends  Use  of  least  squares  was  employed  to  ascertain  causes  for  simi¬ 
larities  in  the  data  base.  A  linear  function  relating  fragment  mass  to 
velocity  was  employed  to  study  effects  of  variation  in  projectile  materials, 
initial  projectile  weights,  striking  velocities,  length  to  diameter  ratios 
and  plate  thicknesses.  Kolmogorov- Smirnov  type  test  statistics  were  use 
to  determine  whether  or  not  a  unique  parent  weight  distribution  existed 
between  various  firings.  The  Weibull,  Poisson  and  Truncated  Normal  cumu¬ 
lative  distribution  functions  were  also  compared  with  empirical  weight 
distributions  for  several  selected  firings.  This  paper  summarizes  the 
characteristics  found. 


1.  INTRODUCTION.  Whenever  armored  vehicles  of  any  kind  are^ attacked 
by  metal  rod  penetrators,  fragments  are  sprayed  inside  the  vehicle  which 
damage  components  and  personnel.  To  facilitate  a  greater  understanding 
of  those  mechanical  processes  involved,  a  gross  characterization  was  done 
that  includes  fragment  numbers,  mass  distributions,  and  spatial  locations 
behind  6.35  and  12.7  millimeter  rolled  homogenous  steel  targets.  Ine 
fragment  data  base  used  for  this  analysis  is  comprised  of  140  test  firings 
completed  at  the  BRL  in  1970.  In  the  data  base  projectile  weights,  length 
to  diameter  ratios,  projectile  material  types, target  plate  thickness, 
fragment  masses,  fragment  locations,  (see  Figure  1 j ,  and  velocities  were 
found  recorded  in  a  BRL  Memorandum  Report^.  This  data  was  transcribed 
onto  IBM  punched  cards  for  computer  reduction  and  analysis.  Initially, 
the  natural  log  of  fragment  mass  and  velocity  were  fitted  with  a  firs 
degree  polynomial  of  the  form.  Y  =  *  a^x  where  y  denotes  the  In  velocity 

parameter  and  x  the  In  of  fragment  mass.  This  polynomial  fitting  and 
plotting  technique  was  later  published  by  the  author  as  a  BRL  Systems 


^L.  Herr  and  C.  Gabarek,  "Ballistic  Performance  and  Beyond  Armor  Data  for 
Rods  Impacting  Steel  Armor  Plates,"  US  Army  Ballistic  Research  Labora¬ 
tories  Memorandum  Report  #2575,  January  1976. 
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Programming  Bulletin^.  The  volume  of  linear  equations  and  plots  produced 
was  found  to  be  valuable  as  a  convenient  index  in  a  search  for  trends  from 
firing  to  firing  which  later  led  to  a  zone  analysis  of  the  data.  A  zone 
definition  and  the  results  of  the  zone  analysis  are  on  the  following  page. 

Zone  number  1  is  represented  by  the  innermost  circle  on  the  recovery 
media  surface  and  is  measured  by  an  angle  of  ten  degrees  with  respect  to 
the  shotline.  Zones  Z-thru-S  are  defined  by  an  angle  increase  of  ten 
degrees  per  zone.  For  all  firings  the  shotline  was  orthogonal  to  the 
target  surface  plane.  Projectile  striking  velocities  were  in  the  900  to 
1500  meters/sec  range.  The  straight  line  function.  In  (fragment  velocity) 
^0  ^  ^1  fragment  mass)  when  fitted  on  a  zone  per  zone  basis  revealed 

a  distinct  trend.  As  zone  angle  increased,  the  slope  values,  a  *s  were 
more  negative  in  value.  This  agrees  with  the  basic  conservation  of  energy 
law  of  physics. 

2,  The  Weibul 1  Distribution  Function.  In  a  testing  of  the  Poisson, 
Truncated  Normal,  and  Weibull  distribution  functions  the  latter  provided 
the  best  fit  to  the  fragment  mass  parameter.  A  detailed  report  of  how 
the  Weibull  distribution  function  parameters  were  estimated  can  be  found 
in  Reference  3,  A  two  sided  Kolmogorov-Smirnov  type  test  was  employed  as 
a  criteria  for  best  fit.  The  empirical  cumulative  distribution  function, 
(Equation  p  was  computed  for  the  fragment  mass  parameters  for  several 
selected  firings.  A  graphical  and  numerical  comparison  with  the  Weibull 
cumulative  distribution  function,  (Equation  2)  was  then  performed. 


F 


N 


0,  X  <  X 


(1) 


K/N,  <  X 


^(N)  1  ^ 


(1) 


F (x)  =  1 . 0  ~  exp 


C2) 


X,  is  the  fragment  mass  parameter. 


Victor  p.  Maki,  ”PDT  Plot  Subroutine  with  Bi- variate  Analysis,”  US  Army 
allistic  Research  Laboratories  Systems  Programming  Bulletin  #SPB“G-74 
17  July  1974.  ^ 


Victor  D.  Maki,  "Three  Probability  Density  Function  FORTRAN  Subroutines," 
US  Army  Ballistic  Research  Laboratories  Interim  Memorandum  Report  #396 
June  1975.  ’ 
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The  computed  maximum  absolute  difference  was  numerically  compared  with, 

1.36/>^  which  is  fully  described  in  Reference  4.  If  the  computed  maximum 
absolute  difference  was  found  to  be  less  than  the  above  statistic,  a 
decision  was  made  to  accept  the  Weibull  distribution  funct^n 
fragment  mass.  Included  in  this  paper  is  a  plot  of  this  type  test  tor 

round  number  5  (see  Figure  2) .  For  "good"  Jha^ 

tion  function  to  a  large  number  of  firings  fragment  masses  greater  tnan 
iSrgSns  sSould  be  iinored.  Because  rod  penetrator  fragment  mass  distri- 
ta?iS  are  characteristically  bi-modal  more  than 

ments  can  be  found  in  the  first  mode  and  therefore,  for  this  data  set, 
ignoring  the  second  mode  caused  no  significant  loss  in  accuracy. 


3.  ZONE  DEFINITION. 


Figure  1 

Zone  1  is  represented  by  the  innermost  circle  and  forms  an  angle  of 
ten  degrees  measured  from  the  shotline.  Zones  2-thru-5  are  define  y 
an  angle  increase  of  ten  degrees  per  zone. 

^W.  J.  Conover,  Practical  Non-Parametric  Statistics^j  published  by 
John  Wiley,  1971,  New  York,  NY. 
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1.36/>^  =  .09866477 


Figure  2 


absolute  difference  is  smaller  than  the  above  value 
the  Weibull  distribution  is  accepted  for  this  firing. 
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The  fitting  of  the  Weibull  distribution  to  spall  fragment  mass  is  doc 
umented  in  Reference  5.  Further  interest  on  the  readers  part  on  this 
topic  should  be  directed  to  Mr.  John  Misey,  US  Army  Ballistic  Research 
Laboratory,  Aberdeen  Proving  Ground,  Maryland  21005, 


5.  CONCLUSIONS  OF  ANALYSIS. 

a.  As  zone  angle  increases,  average  fragment  velocity  decreases, 
numbers  of  fragments  decrease  and  average  mass  increases. 

b.  A  two-sided  Kolmogorov- Smirnov  type  test  shows  the  Weibull 
tribution  function  is  a  good  choice  for  describing  fragment  mass  less  than 
100  grains. 

c.  Rod  penetrator  fragment  mass  distributions  are  characteristically 
bi-modal. 


^"Behind  Armor  Data  for  Long  Rod  Penetrators,"  paper  presented  by 
Mr.  John  Misey  at  the  Second  Annual  Automatic  Cannon  Caliber  Munitions 
Symposium,  25  September  1975  at  Frankfort  Arsenal. 
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MATHEMATICAL  MODELS  OF  SYSTEMS  AND  TACTICS  IN  LAND  COMBAT 


Roger  F.  Willis 

US  Army  TRADOC  Systems  Analysis  Activity 
White  Sands  Missile  Range,  New  Mexico  88002 


ABSTRACT  This  oaper  covers  a  variety  of  mathematical  models  that  have 
recently  been  developed,  tailored  to  specific  decision  problems  in  tactics 
and  alternative  system  tradeoffs.  These  models  emphasize  rapid  and  flexible 
variation  of  assumptions,  investigation  of  alternative  tactics,  tradeoffs 
between  system  parameters,  tradeoffs  between  the  elements  of 
various  optimizations.  Alternative  mathematical  formulations  include  linear 
versus  non-linear,  constant  versus  time-varying  coefficients  and  stochastic 
versus  deterministic. 


1.  INTRODUCTION 

Flexible  and  efficient  mathematical  models  are  required  for  use  in  different 
phases  of  a  particular  force  evaluation  or  combat  developments  study.  In  an 
early  phase  these  models  can  be  used  to  compare  and  screen  alternatyes  -- 
alternative  systems,  alternative  tactics  or  alternative  mixes.  In  late  phases 
the  same  models  can  be  used  for  sensitivity  analysis,  to  give  approximate 
answers  to  "what  if"  questions  —  i.e.,  to  determine  how  study  results  might 
change  if  certain  assumptions  are  varied.  In  most  studies  the  major  analyt¬ 
ical  tool  will  be  a  large,  relatively  slow  and  expensive  computer  model  or 
simulation  or  computer-assisted  wargame  (e.g.,  DIVWAG).  The  mathematical 
models  presented  in  this  paper  are  intended  to  supplement  the  la^'Oe  models, 
to  provide  additional  insights  and  to  enrich  the  study  results.  These  models 
can  also  be  used  to  develop  hypotheses  (e.g.,  about  the  relative  merits  of 
alternative  tactics)  that  can  then  be  tested  with  high  resolution  stochastic 
simulations. 

2.  We  consider  the  class  of  models  consisting  of  sets  of  ordinary  differential 
equations  in  which  each  equation  represents  the  time  rate  of  change  of  the 
number  remaining  of  a  particular  type  of  weapon.  The  equations  can  be 
deterministic  or  stochastic,  linear  or  non-linear,  with  constant  coeffictents 
or  variable  coefficients.  We  consider  ten  specific  decision  problems  and 
the  mathematical  categories  of  models  as  follows: 


a.  deterministic,  linear,  constant  coefficients 

(1)  tradeoff  between  ground  forces  and  aircraft 

(2)  remotely  piloted  vehicles 

(3)  air  defense  suppression 
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(4)  optimum  artillery  mix 

b.  deterministic,  linear,  variable  coefficients 

(1)  antiarmor  target  priorities 

(2)  optimum  disengagement  time 

(3)  electronic  warfare 

c.  deterministic,  non-linear 

(1)  weapon  effectiveness 

(2)  force  required 

d.  stochastic 


(1)  time  to  achieve  goal 

3.  More  complete  statements  of  the  decision  problems  are: 

a.  To  what  extent  can  tanks  be  traded  off  for  close  support  aircraft? 

nnneuv.r  ‘o  “"p 


and  counterbattery  fire 

ing  a'^ecK^nlld  Wan?Ji”™v1s?onJ  ^'‘<‘0^^^- 

antt-a™o?°weapSnlj  ^''ccated  between  three  or  four  types  of 

f.  What  is  the  optimum  time  for  a  defending  anti-armor  force  to  disengage? 
attack;-ng''Jr*J«iI;g  ESr  surveillance  systenis  and 

h.  What  weapon  effectiveness  is  required  against  a  given  enemy  force: 

(1)  if  replacements  are  available? 

(2)  if  no  replacements  are  available? 
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i.  What  force  size  is  required  against  a  given  enemy? 

j.  With  a  given  force  available,  how  much  time  would  be  required  to 
reduce  an  enemy  force  to  a  specified  level? 


4  In  this  paper  we  will  present  models  for  only  four  of 

Lb  ems-  a  T  i  and  j.  These  particular  examples  were  selected  to  illustrate 
the  four  categories  of  models  and  several  different  measures  of  effectiveness. 

In  the  first  problem,  involving  tradeoffs  between  ground  and  air, 
interested  in  the  broader  question  of  what  mix  of  ground  ? 

do  we  need  in  NATO?  What  factors  should  be  incorporated  in  a  simple  model 
designed  to  give  gross,  order-of-magnitude  answers  to  this  question?  Some 
of  them  are:  aircraft  availability  rate  and  sortie  rate,  attrition  of 
aircraft,  allocation  of  aircraft  against  alternative  target  types 
or  artillerv)  lethality  of  air-delivered  weapons,  tank  effectiveness,  tank 
SulnLbility  and  artillery  effectiveness.  The  model  is  presented  in  Figure  1, 
with  variables  and  factors  defined  as  follows: 


Xi(t)  =  Red  tanks 
X2(t)  =  Red  artillery 
Yi(t)  =  Blue  tanks 
Y2(t)  =  Blue  artillery 
=  Blue  aircraft 

J  =  rate  at  which  a  Blue  tank  can  kill  Red  tanks 

K  =  rate  at  which  a  Red  tank  can  kill  Blue  tanks 

P  =  Blue  aircraft  attrition  rate  per  sortie  flown 
b  =  rate  at  which  a  Blue  artillery  weapon  can  kill  Red  tanks 
k  =  average  number  of  Red  tanks  killed  per  aircraft  sortie 
s  =  sortie  rate,  per  available  aircraft 

V  =  aircraft  availability  rate,  taking  into  account  NORM,  NORS,  etc. 
r  =  replacement  rate  for  Red  tanks 

L  =  rate  at  which  a  Blue  artillery  weapon  can  kill  Red  artillery 

M  =  rate  at  which  a  Red  artillery  weapon  can  kill  Blue  artillery 

N  =  average  number  of  Red  artillery  weapons  killed  per  aircraft  sortie 
f  =  fraction  of  Blue  aircraft  sorties  employed  against  Red  tanks  (the  rest 
are  used  against  artillery) 

g  =  fraction  of  Blue  artillery  employed  against  Red  tanks  (the  rest  are  used 
against  artillery) 
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RED  TANKS 

dX, 

— 1  =  -  JY,  -  bgY-  -  VskfY,  +  r 
dt  ^ 

RED  ARTILLERY 

dX- 

_i  =  -  L(l-g)Y2  -  VsN(l-  f)Y, 
dt  ^ 

BLUE  TANKS 

dY, 

— i  =  -  KX, 
dt 

BLUE  ARTILLERY 

dY, 

— ?  =  -  MX- 
dt 

BLUE  AIRCRAFT 

dY, 

_ 2  =  -  VsPY, 

dt 

Figure  1 

Rates  at  which  Committed  Strengths  Change 
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From  the  first  differential  equation  in  Figure  1  we  see  that  Red  tanks 
are  killed  by  Blue  tanks  (Y, ),  Blue  artillery  (Yp)  and  Blue  aircraft  (Y3). 

To  some  extent  Red  tank  losses  are  compensated  for  by  replacement  tanks, 
at  a  rate  of  r  per  minute.  The  Blue  commander  has  two  weapon  allocation 
problems:  allocation  of  available  aircraft  against  tanks  (f)  and  against 
artillery  (1-f);  allocation  of  available  artillery  against  tanks  (gj  and 
against  artillery  (1-g)* 

5.  The  solutions  of  this  model  express  the  numbers  of  weapons  of  each  type 
surviving  {d.nd  committed)  as  functions  of  time.  This  model  can  be  Jfsed^ 

to  investigate  tradeoffs  between  tanks  and  close  air  support  aircraft  in 
the  following  way.  We  set  a  tactical  goal  and  calculate  the  various 
combinations  of  "number  of  tanks"  and  "number  of  aircraft  ,each  of  which 
will  achieve  the  goal.  An  example  of  a  goal  is:  'Reduce  the 
Red  tank  strength  by  100  within  2  hours.  The  tradeoff  curves  (tanks  versus 
aircraft)  will  usually  depend  on  the  values  assumed  by  all  the  other  factors 
in  the  model,  such  as  Red  tank  effectiveness.  Blue  tank  effectiveness.  Blue 
aircraft  attrition  rate,  number  of  Blue  artillery  tubes  available,  etc. 

The  tradeoff  curves  also  vary  with  the  type  of  goal  required. _  Reduce  the 
Red  to  Blue  tank  force  ratio  by  50%  in  6  hours"  would  give  different  curves. 

6.  For  example,  if  we  leave  out  Red  and  Blue  artillery  to  simplify  the 
calculations  and  make  the  following  assumptions 


J  =  .003 


K  =  .001 

V  =  0.70 


S  =  .004 
P  =  .05 


k  =  2 


we  get  the  following  results,  for  the  Blue  goal  of  killing  the  required 
number  of  Red  tanks  within  16-2/3  hours: 


Number  of  Red 

Number  of 

Blue 

tanks  killed 

aircraft 

tanks 

400 

100 

133 

75 

200 

50 

267 

300 

100 

33 

75 

100 

50 

167 

7.  The  tradeoff  between  Blue  tanks  and  Blue  aircraft  depend  on  two  major 
uncertainties:  the  duration  of  combat  and  the  ratio  of  Blue  tank  effectiveness 
(J)  to  Red  tank  effectiveness  (K).  We  see  this  directly  in  the  following 
results,  based  on  the  assumptions:  V  =  0.70,  S  =  0.004,  k  =  2,  p  =  0.05. 
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Ratio  of  Blue  tank 
effectiveness  to  Red 
tank  effectiveness 

1  to  1 


Combat  Time 
(minutes) 


Number  of  Blue  tanks 
equivalent  to  one 
Blue  aircraft 


3  to  1 


8.  For  the  next  decision  problem  (3e)  the  question  is:  How  should  Red 
tank  fire  be  allocated  between  three  or  four  types  of  Blue  anti -armor 
weapons  (targets)?  The  Red  side  makes  tactical  judgments  about  allocation 
of  fire.  Here  we  let  f,  be  the  fraction  of  Red  fire  directed  against  type  1 
Blue  weapons,  the  fraction  of  Red  fire  directed  against  type  2  Blue 

^i  vary  with  time  during  the  battle, 

but  in  the  examples  given  here  we  assume  that  for  a  given  battle  each  f. 

IS  given  a  fixed  value,  with  the  sum  of  f^.  equal  to  one.  ^ 

9.  The  ability  of  individual  weapons  to  kill  targets  (detect,  hit  kill) 

IS  assumed  to  vary  with  time  during  the  battle  (comparable  to  variations 

intervisibility,  detection  and  weapon  accuracy  change).  The 
at  h-  dynamics  of  combat  as  the  battle  progresses,  the  rates 

modef'^a  set  surviving  changes  due  to  attrition.  The 

model,  a  set  of  N  +  1  differential  equations,  is  given  below  in  oara  10 
The  factors  and  variables  are  defined  as  follows:  ^ 

X  =  number  of  Red  tanks 

Y^=  number  of  type  1  Blue  weapons  (e.g.,  M60A1E3) 

Y2=  number  of  type  2  Blue  weapons  (e.g.,  TOW  on  M113) 

Y2=  number  of  type  3  Blue  weapons  (e.g.,  TOW  on  jeep) 

Y4=  number  of  type  4  Blue  weapons  (e.g.,  DRAGON) 


Yj^~  number  of  type  N  Blue  weapons. 

f^.=  fraction  of  Red  tank  fire  allocated  against  type  i  Blue  weapons 
(This  could  include  target  opportunities  as  well  as  target  priorities.) 

Kj  (t)  =  average  rate  at  which  a  type  i  Blue  weapon  can  kill  Red  tanks 

P™babnny,  rate  of  fire  and  kill 

(t)  =  average  rate  at  which  a  Red  tank  can  kill  type  i  Blue  weapons 
We  assume  that  each  K.  and  J.  is  a  linear  function  of  time.  In  particular. 
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K.(t)  =  a.  +  b  .t 

%  'I  "h 

J.(t)  =  G.  +  d.t 
^  %  % 


10,  The  model  is: 


Red  tanks  * 


Blue  weapons 


=  -  4 


11.  We  consider  five  alternative  tactical  allocation  schemes  for  the  Red 
tanks,  as  follows: 


a.  Initial  Blue  strength 


y .  (o) 


4  =  r - 

E-  X.(0) 


b.  Equal  priorities  by  target  type 

f  =  - 
•'i  N 

c.  Initial  threat  to  Red  tanks. 


a . 

f  = 

•'i  N 


Z  a . 


d.  Initial  ease  of  killing. 


4 


Z  Q  . 
3-1  ^ 
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e.  Later  threat  to  Red  tanks  (time  t,  e.g.,  t  =  5  or  10) 


a.  +  b .  t 

f  _ 'I' 

H  N 


I  (a .  +  b .  t) 

0=1  ^  J 


^k’  these  Red  alternatives  we  must  make  assumptions 

aoout  tne  initial  force  sizes  xu_  — . 

weapon  kill  capabilities.  In 
Blue  weapons 


number  of 

runs  we 

used  the  following 

values: 

Blue  versus  Red 

Red  versus 

Blue 

a . 

b. 

c . 

d. 

.152 

.163 

.013 

.015 

.053 

.270 

.008 

.040 

.000 

.600 

.000 

.060 

type  1 (tanks) 
type  2  (long  range  ATOM) 
type  3  (short  range  ATOM) 

Wth  overall  Red  to  Blue  initial  force  ratios  on  the  order  to  3  to  1  or 

4  to  1  the  or^r  of  preference  for  the  Red  alternatives  in  para  11  turned 
out  as  follows: 


Best:  e.  later  threat  to  Red  tanks 

a.  initial  Blue  strength 

b.  equal  priorities  by  type 
d.  initial  ease  of  killing 

Worst:  c.  initial  threat  to  Red  tanks 


13.  In  the  next  decision  problem  (3i)  we  consider  the  tactical  question 
of  how  a  given  initial  Blue  force  should  be  broken  up  into  smaller  units 
for  employment  against  the  enemy.  If  the  effectiveness  of  the  defending 
Blue  force  does  not  depend  on  the  absolute  scale  of  the  battles  fought 

u  ratio)  then  it  might  not  matter  how 

the  initial  force  is  broken  up  into  units.  We  have  investigated  many 

Jlsultflor 


Model  A 


Model  B 


dt 


dt 


-  KB(t) 

-KlB(t)'f 


dB  _ 
dt  ~ 


dt 


-  JR(t) 

-JlR(t)2^ 


430 


Model  C 


^  -  LB(t)R(t)  +  aR(t) 

at 

Ir  =  -  JR(t)B(t)  +  bB(t) 


Model  D 


^  -  (p  +  tit)B(t) 

#  =  -  (a  +  bt)R(t) 


14.  In  evaluating  combat  of  maneuver  units  the  single  most  meaningful  measure 
of  effectiveness  is  the  cumulative  loss  ratio— i.e.,  the  ratio  of  total  Red 
losses  to  total  Blue  losses.  This  loss  ratio  will  depend  on  many  factors, 
including  (in  most  cases)  the  initial  force  ratio— the  ratio  of  the  initial 
number  of  Red  weapons  to  the  initial  number  of  Blue  weapons.  If  we  cal¬ 
culate  the  cumulative  loss  ratio  at  the  particular  time  t  at  which  Blue 
has  a  fraction  "A"  of  his  force  surviving  (e.g.,  A  =  0.70)  the  result  for 
Model  A  is: 


where  Fo  is  the  initial  force  ratio  (  -f-)  and  o  =  j.  the  ratio  of  individual 
weapon  effectiveness  coefficients.  It  fs  clear  that,  for  Model  A,  L  does 
not  depend  on  the  scale  of  the  battle  (Bo  or  Ro)  but  only  on  the  initial 
ratio  of  forces. 


15.  For  Model  B,  the  loss  ratio  is: 

N+1 


1  - 


1  - 


o  (1 


M+1  M-N 
A  )Ro _ 


M+1 


If  M  does  not  equal  N  then  L  does  depend  on  the  scale  (Ro)  but  if  M  *  N 
then  it  does  not. 

16.  The  cumulative  loss  ratio  L  satisfies  the  following  equation  when 
B(t)  equals  ABo  for  Model  C: 

b  log  ^Eo  -  (1  -  +  J(1  -  A)  Bo  B  — 

b  log  Fo  +  a  log  A  +  K(1  -  A)  Bo 


Since  Bo  appears  explicitly  the  loss  ratio  does  depend  on  scale  for  Model  C. 
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17.  Based  on  the  Taylor  series  solutions  of  Model  D,  the  cumulative  loss 
ratio,  at  any  time  t,  is: 

2 

L(t)  r>t  -  2  (vaF^  -a)  +  J  jp  a  -(pb+2qa)Fo2  - 

oFot  -  ^(ap  -  bFo)  +  ia%Fo  -  (aq  +  2bp)-j  -••• 

2  S 

This  expression  depends  on  Fo  but  not  on  Bo  or  Ro  (and  hence  not  on  the 
scale  of  battle). 


18.  The  final  decision  problem  to  be  illustrated  in  this  paper  is  3j: 
with  a  given  force  available,  how  much  time  would  be  required  to  reduce 

a  specified  level?  For  example.  Red  has  six  tank  platoons 
and  Blue  has  three  tank  platoons.  How  long  would  it  take  Red  to  reduce 
Blue  to  ep  of  his  initial  strength  (e.g.,  with  about  2  platoons  left)? 

These  numbersare  too  small  for  stable  results  from  a  deterministic  model. 
Thus,  we  consider  the  following  stochastic  model,  developed  by  Isbell  arid 
Marlow.  At  time  t,  the  probability  that  exactly  R  Red  units  and  exactly  B 
Blue  units  are  surviving  is:  - 


P(R,  B;  Ro,  Bo,  t) 


where  Ro  and  Bo  are  the  initial  strengths.  If  f  and  g  are  transit! 
probabilities,  in  small  increments  of  time,  for  Red  and  Blue  r 

then  it  is  assumed  that  the  function  P  satisfies  the  following 
equations:  ^ 


pectively, 

differential 


dP(R,B)  n  ^  ,  P"  ”1 

=  f  (R+1,  B)  P(R+1,B)+  g(R,B+l)P(R,B  +  1)  -  B)  +  0(R,  BjJ  P 

(R,  B) 


where  P(R,  B,  t)  =  0  if  R  >  Ro  or  B  >  Bo 
and  P(Ro,  Bo,  0)  =  1. 


^  linear  functions  of  weapon  characteristics 
and  that  Red  and  Blue  weapon  are  equally  capable,  then  the  solutions  are 
as  given  in  Figure  2,  where  the  functions  F  satisfy  the  following  relations: 


F(R,  B-,  Ro,  Bo)  = 


B 


R  +  B  +  1 


F(R  +  1,  B;  Ro,  Bo) 


R  +  B  +  1  ^  Tj  R oi  R o) 
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PROBABILITY  THAT  EXACTLY  R  AND  B 
ARE  SURVIVING  AT  TIME  t  IS: 


II 


oT 

CD 

a. 


CQ  CD 
+* 

OC  01 


CD 

U. 
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FIGURE  2 


19.  Examples  of  specific  solutions  are  the  following: 

a.  Initial  Red  force;  6  platoons 
Initial  Blue  force;  3  platoons 


Time  required  to  reduce  Blue  force 
by  65% 

Probability  _ (t  minutes  or  leSs) 


.63  7 

.56  5 

.42  3 

.16  1 


b.  Initial  Red  force;  4  platoons 
Initial  Blue  force:  2  platoons 


Time 

Probability 


.42 

.32 

.19 

.06 


required  to  reduce  Blue  force 
by  65% 

(t  minutes  or  less) 


7 

5 

3 

1 
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SUPERPOSITION  OF  SOLUTIONS  IN  A  MULTIPOINT  BOUNDARY 

VALUE  PROGRAM 
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ABSTRACT.  A  shooting  method  is  the  superposition  of  initial  value  solutions 
of  ordinary  differential  equations  such  that  boundary  are  "met"  or  a  performance 
index  is  minimized. 

The  results  of  meeting  noisy  boundary  conditions  in  least  squares  and  minimax 
norms  are  presented.  The  example  problem  is  a  damped,  forced  harmonic  oscillator. 

The  procedures  are  basic  to  system  identification  problems. 

1.  INTRODUCTION.  The  linear  boundary  value  problem  is  governed  by  the 
ordinary  differential  equation 

y  =  Ly  +  f 

where  y  and  y  are,  respectively,  the  vector  of  n  state  variables  and  its  deriva¬ 
tive,  L  and  f  are  matrix  and  vector  functions  of  the  independent  variable  t,  time. 
The  solution  of  this  differential  equation  is  subject  to  a  set  of  boundary  condi¬ 
tions 


q  .(y(t .) )  =  b .  i-  =  (2) 

where  m  is  the  boundary  condition  operator  that  specifies  a  linear  combination 
of  the  state  variables  equal  to  the  boundary  value,  at  time, 

A  shooting  method  is  to  superimpose  appropriately  independent  solutions  of 
(1).  This  can  be  written  as 


y  = 


n 


(3) 


where  is  a  particular  solution  of  (1)  and  is  the  corresponding  superposi¬ 
tion  constant. 
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The  independence  properties  can  be  assured  by  the  following  strategy.  Assume 
(0)  =  a.  Then  take 
(-}) 

Pi  ^  (4) 

where  6  is  the  Kronecker  delta  and  all  0.  The  above  strategy  gives  a  deter- 

minant  (Wronskian)  of  the  associated  homogeneous  differential  eguations.  at  t=Ot 
of  the  product  of  the  b's. 

The  superposition  of  particular  solutions  also  requires  that 


n 

I  a.  =  i.  (5) 

3=0  ^ 

Childs  et  al.  [1970]  give  more  details  on  the  strategy  and  a  proof  of  (5).  If  a 
is  the  initial  value  vector  that  makes  (1)  satisfy  (2)  then  it  is  obvious  that  a^=l 

and  aj=a^=...=a^=o.  How  close  the  actual  superposition  constants  come  to  these 

values  is  an  indication  of  the  merit  of  the  numerical  method. 

The  superposition  (3)  is  substituted  into  the  boundary  conditions  (2).  If  the 
boundary  conditions  are  linear  in  y,  then  the  operators  q  and  z  may  be  interchanged 
giving 


I  a.  =h. 

3=0  ^  ^  3  ^ 


i  =  2,  . .  .,m. 


(6) 


The  use  of  shooting  procedures  results  in  the  particular  solutions,  p^^\  being 
known  and  we  observe  that  the  bracketed  terms  in  (6)  are  simply  coefficients  of  an 
algebraic  equation  in  the  unknown  superposition  constants.  Me  write  (5)  and  (6)  as 
the  matrix  equation 


Sa  =  d 

C7) 

where 

^0,3  "  ^0  ^ 

and 

11 

i  = 

J  _  -7 

(8) 

3  ~  0^  •  •  •  i 


2.  NONLINEARITIES.  If  the  differential  equation  is  nonlinear  then  we  may 
write  it  as 


y  =  g(y»t).  (9) 

Equation  (9)  is  linearized  via  a  Taylor  series  expansion  in  order  to  obtain  an 
equation  linear  in  s; 


3  =  g(w,t)  +  r|^l  C  g  «  7  (10) 

where  u  is  a  reference  solution  or  a  previous  approximation  to  y.  Equation  (10) 
may  be  rewritten  as 
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where 


and  g'  =  g(Wjt)  -  Jw  . 


(11) 

(12) 


z  =  Jz  +  g' 


J 


y=w 


Therefore,  if  we  are  given  a  nonlinear  differential  equation  subject  to 
boundary  conditions,  we  may  approximate  it  by  the  linear  equation 

z  =  Jz  +  g'  (13) 

subject  to  the  boundary  conditions 

q.(z(t.))  =  b-  i  =  (14) 

It  is  obvious  that  (13)  and  (14)  are  analagous  to  (1)  and  (2)  and  so  we  proceed 
in  the  same  fashion.  The  only  difference  is  that  now  the  solution  is  obtained 
iteratively. 

We  again  superimpose  n+l  particular  solutions  of  (13)  that  meet  the  boundary 
conditions  (14): 


n 


3=0  ^ 

where  each  satisfies 

^  Jp(3)  ^  gt^ 

The  superposition  constants  are  determined  by 

n 

E  a  •  =  i 
3=0  ^ 


(15) 


(16) 


(17) 


and  n  ... 

q.C  I  (p^^' (t-))a.}  =  b.  i  =  .  (18) 

If  the  operators,  q.,  are  linear  then  (17)  and  (18)  form  a  set  of  linear  equations 
analagous  to  (7). 

If  any  of  the  boundary  condition  operators,  ^i,  are  nonlinear,  then  they  must 
also  be  linearized  by  a  Taylor  series  expansion.  The  linearization  is  done  with 
respect  to  the  superposition  constants  with  the  initial  reference  values  of  the 
vector,  a,  as 


(X  *  “  0  3  —  . 

J 

The  reader  can  refer  to  Childs  et  al.  [1970]  and  to  Roberts  and  Shipman 
for  more  details  on  these  linearization  procedures. 


(19) 

[5] 
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3.  OVERDETERMINED  BOUNDARY  CONDITIONS.  If  the  nunjher  of  boundary  condi¬ 
tions,  m,  is  greater  than  the  order  of  the  differential  equation,  n,  then  (7) 
constitutes  an  overdetermined  set  of  linear  equations  with  the  a.'s  unknown. 

Not  all  of  the  equations  can  be  met  exactly,  some  will  have  to  be  met  in  a 
“best  fit"  sense.  Let's  assume  that  p  of  the  m  boundary  conditions  are  to  be 
met  exactly,  then  p+l  of  the  m+1  equations  (the  superposition  condition  is 
included)  must  be  met  exactly.  Equation  (7)  may  be  partitioned  as  follows; 


“a 

e 

r - 

a 

3  4^ 

L  oJ 

The  components  5^,  and  correspond  with  the  equations  to  be  met  exactly. 
By  suitable  matrix  operations,  (20)  can  be  transformed  into 


fi  sll 

"a 

2 

e 

& 

a 

d’ 

L  4J 

U 

-  0-* 

Two  matrix  equations  results  from  (21): 


(21) 


%  =  < 


a  =  d  -  S„  a 
e  e  2  o 


(22) 

(23) 


Equation  (22)  is  solved  in  a  "best  fit"  sense  for  a  ,  which  is  then  substituted 
into  (23)  for  a^.  ^ 

Once  the  superposition  constants,  are  found,  they  are  multiplied  by 

their  appropriate  particular  solutions  at  t=0,  that  is,  which  yields 

an  estimate  of  y(0)^  i.e. 


7%  » 

y(0)  =  lp^^^(0)a..  (24) 

3=0  ^ 

If  the  problem  is  nonlinear,  the  LHS  of  (24)  is  taken  as  the  unperturbed  par¬ 
ticular  solution  at  t=0^  (o) .  Independent  perturbed  solutions  are  generated 

by  the  strategy  described  in  (4)  and  a  new  set  of  superposition  constants  found. 

The  methodis  repeated  until  convergence  of  consecutive  p^^^ (0)  vectors  are 
observed  (i.e.,  will  approach  unity  and  all  other  ads  will  approach  zero). 

There  are  two  principal  methods  of  solving  overdetermined  systems  of 
linear  equations.  They  are: 

1)  least  squares  solution 

2)  minimax  or  Chebyshev  solution. 


a.  Least-Squares  Solution:  Given 


the  residual  vector  can  be  written, 

^  =  ^4%  -K  • 


(26) 


The  least-squares  solution  is  the  vector,  that  minimizes  the  sum  of  the 
squares  of  the  components  of  the  residual  vector,  R,  is: 


This  is  substituted  back  into  (23)  to  find 


(27) 


b.  Minimax  Solution:  The  minimax  solution  is  the  vector  which  minimizes 
the  largest  absolute  value  of  the  components  of  the  residual  vector  (26).  That 
is,  we  want  to  minimize  •  The  method  advanced  by  Powell  is 

used  in  the  program.  See  [3]  and  [4]  for  more  specifics  on  the  minimax  method. 

4.  RESULTS.  We  considered  the  following  problem 

X  +  \ix  i-  =  sin(t)  (28) 

which  is  the  equation  of  motion  for  a  forced,  damped  harmonic  oscillator. 

By  the  change  of  variables 

~  y 2  ~  y 3  ~  y^  ~  ^ 

(28)  may  be  replaced  by 

yj  =y2 

h  =  -^4^1  '^3^2 

1)4  =  0  . 

Initial  values  were  selected  for  the  state  variables  and  solutions  for 
y  and  y  were  generated  on  the  interval  At  times#  ^ the 

value  of  y2  was  observed.  These  values  were  taken  as  the  exact  boundary  values 
(to  8  significant  figures).  Six  sets  of  "noisy"  data  were  produced  by  two 
techniques.  The  first  was  to  round  off  the  exact  boundary  values  to  1,2,  and 
3  decimal  places  to  the  right  of  the  decimal  point.  The  second  technique  was 
to  add  a  random  variable  that  was  normally  distributed  with  a  mean  of  sew  and 
a  standard  deviation,  a.  Three  different  values  of  a  were  used:  .01^  .7,  and 
.5.  See  Table  1  for  the  sets  of  boundary  values.  The  program  was  run  using 
each  set  of  the  corrupted  boundary  values  as  data.  The  errors  between  the 
originally  selected  initial  conditions  and  those  that  the  computer  estimated 
from  the  noisy  data  were  computed.  Both  least-squares  and  minimax  were  employed 
to  solve  the  overdetermined  system,  for  each  data  set.  See  Tables  2,  3,  4 
and  5.  Two  criteria  were  chosen  as  the  basis  for  evaluating  the  closeness 
of  fit:  (1)  the  sum  of  the  absolute  values  of  the  errors  and  (2)  the  sum  of  the 
squares  of  the  errors.  In  all  cases,  the  least-squares  solution  proved  the 
better  fit,  as  expected.  The  accuracy  of  the  parameter  estimates  is  impressive, 
even  with  the  noisiest  data. 


(29) 


(30) 
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t . 

Boundary  values 
rounded  to  3 
decimal  places 

Boundary  values 
rounded  to  2 
decimal  places 

Boundary  values 
rounded  to  1 
decimal  place 

1 

-0.220 

-0.22 

-0.2 

2 

0,350-01 

0.30-01 

0.0 

3 

-0.474 

-0.47 

-0.5 

4 

-0.589 

-0.59 

-0.6 

5 

0,393 

0.39 

0.4 

6 

1.597 

1.60 

1.6 

7 

1.452 

1.45 

1.5 

8 

-0.388 

-0.39 

-0.4 

9 

-2.324 

-2.32 

-2.3 

10 

-2.274 

-2.27 

-2.3 

11 

0. 880-01 

0.90-01 

0. 10-00 

12 

2.711 

2.71 

2.7 

13 

2.997 

3.00 

3.0 

14 

0.401 

0.40 

0.4 

15 

-2.816 

-2.82 

-2.8 

t . 

% 

Boundary  values 
with  N(0,.01) 
r.v.  added 

Boundary  values 
with  N(0,.l) 
r.v.  added 

Boundary  values 
with  N(0,.5) 
r.v.  added 

1 

-0.21869550 

-0.20667165 

-0.15323232 

2 

0.377982680-01 

0.660643610-01 

0.19169151 

3 

-0.46666120 

-0.40497946 

-0.13083809 

4 

-0.59655859 

-0.66370844 

-0.96215247 

5 

0.39299204 

0.39030898 

0.37838423 

6 

1.5960593 

1.5846639 

1.5340173 

7 

1.4712304 

1.6443249 

2.4136342 

8 

-0.38313658 

-0.33635166 

-0.12841859 

9 

-2.3169182 

-2.2529945 

-1.9688892 

10 

-2.2631763 

-2.1656611 

-1.7322599 

11 

0.918265520-01 

0.12353162 

0.26444312 

12 

2.6876393 

2.4736753 

1.5227239 

13 

3.0007328 

3.0323136 

3.1726727 

14 

0.39058453 

0.29953824 

-0.10511233 

15 

-2.8133056 

-2.7851188 

-2.6598437 

TABLE  1 
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True  value 
of  initial 
conditions 

I.C.  estimates  using 
Least  Squares.  B.V. 's 
input  with  8  signifi¬ 
cant  figures 

I.C.  estimates 
using  Minimax.  B.V.'s 
input  with  8  significant 
figures 

X(0) 

1— » 

• 

O 

0.99999999 

0.99999997 

X(0) 

0.5 

0.49999999 

0.49999998 

V 

0.2 

0.20000000 

0.20000000 

5 

1.0 

1.00000000 

1.00000000 

II 

ro 

• 

o 

o 

o 

CD 

00 

o 

1 

Q 

O 

• 

11 

IXl 

1  e^.  =  2.0D-16 

y  e!  =  l.OD-15 

TABLE  2 
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I.  C.  estimates  using 

Least  Squares.  B.V.'s 
are  rounded  to  3  deci¬ 
mal  places 

I.  C.  estimates  using 
Minimax.  B.V.'s  are 
rounded  to  3  decimal 
places 

X(0) 

1.0000524 

0.99976973 

X(0) 

0.49978310 

0.49976451 

y 

0.20004417 

0.20009774 

5 

0.99996730 

0.99998927 

l\e.\  =  3.4617D-04 

l\e.\  =  5.7423D-04 

1  e^.  =  5.2812D-08 

1  e?  =11.81480-08 

I.  C.  estimates  using 

Least  Squares.  B.V.'s 
are  rounded  to  2  deci¬ 
mal  places 

I.  C.  estimates  using 
Minimax.  B.V.'s  are 
rounded  to  2  decimal 
pi  aces 

X(0) 

1.0011527 

1.0008828 

X(0) 

0.50086499 

0.50155621 

y 

0.19989372 

0.19996940 

C 

1.0001414 

0.99988326 

l\e.\  =  2.2654D-03 

l\e.\  =  2. 58640-03 

't' 

I  e!  =  2.1082D-06 

If 

y  e!  =  3.21570-06 

TABLE  3 
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I.C.  estimates  using 

Least  Squares.  B.V.'s 
are  rounded  to  1  deci¬ 
mal  place 

I.C.  estimates  using 
Minimax.  B.V.'s  are 
rounded  to  1  decimal 
place 

X(0) 

0.95849170 

0.93474852 

X(0) 

0.50061069 

0.48097522 

y 

0.20243731 

0.20584127 

0.99890736 

0.99848235 

l\e.\  =  4. 56490-02 

l\e^\  =  9.16350-02 

y  e^.  =  1.73040-03 

4-  ^ 

y  e!  =  4.65610-03 

4-  '1 

I.C.  estimates  using 

Least  Squares.  N(0,.01) 
r.v.  added  to  the 

B.V.'s 

I.C.  estimates  using 
Minimax.  N(0,.01)  r.v. 
added  to  the  B.V.'s 

X(0) 

1.0009066 

0.99134015 

±{0) 

0.49193995 

0.48700959 

y 

0.20106603 

0.20179481 

C 

1.0003406 

0.99973471 

l\e_^\  =  9.77550-03 

l\e.\  =  23.71040-03 

y  e?  =  6.70490-05 

y  e?  =  24.70350-05 

TABLE  4 


493 


I.C.  estimates  using 
Least  Squares.  N(0,.l) 
r.v.  added  to  the 
B.V.'s 


I.C.  estimates  using 
Minimax.  N(0,.l)  r.v. 
added  to  the  B.V.'s 


^(0)  1.0112150 

^(0)  0.41910446 

P  0.21071785 

I  1.0033087 


0.91127941 

0.36854682 

0.21777510 

0.99682169 


=  .1061 

i\‘i\ 

=  .2411 

1  8^ 

=  6.7957D-03 

=  2. 54770-02 

I.C.  estimates  using 

Least  Squares.  N(0,.5) 
r.v.  added  to  the 

B.V.'s 

I.C.  estimates  usi 
Minimax.  N(0,.5) 
added  to  the  B.V. ' 

X(0) 

1.1020264 

0.49280864 

X(0) 

0. 904619840-01 

-0.18505768 

V 

0.25470622 

0.28377993 

1.0143196 

0.97314194 

=  .5809 

lUi\ 

=  1.3029 

1  =  .1813 

i 

=  .7343 

TABLE  5 
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ABSTRACT.  An  extension  of  regression  analysis  from  the 
usual  algebraic  models  to  differential  equation  models  is  given. 

A  shooting  method,  superposition  of  appropriately  independent 
initial  value  solutions  of  differential  equations,  is  used.  The 
shooting  method  used  is  based  on  particular  solutions  of  the  gov¬ 
erning  differential  equations.  Nonlinear  differential  equations 
and/or  boundary  conditions  can  be  accommodated. 

The  statistics  of  linear  regression  are  generated  through 
a  straightforward  analysis  of  variance.  These  provide  the  basis 
of  "acceptance"  or  "rejection"  of  the  regression. 

The  statistics  generated  include  an  (uncorrected)  ANOVA 
tables,  general  F-test  on  the  regression,  R^  value,  the  coefficient 
of  variation,  covariance  matrix  of  the  superposition  constants, 
estimate  of  the  variance  about  the  regression,  estimate  of  the 
variance  of  the  parameters,  and  the  confidence  intervals  of  these 
estimates . 

The  procedures  are  basic  to  system  identification  problems. 
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1.  INTRODUCTION.  The  linear  boundary  value  problem  is  governed  by  the 
ordinary  differential  equation 
y  =  Ly  +  f 

where  y  and  y  are,  respectively,  the  vector  of  n  state  variables  and  its  deriva¬ 
tive,  L  and  f  are  matrix  and  vector  functions  of  the  independent  variable  t, 
time.  The  solution  of  this  differential  equation  is  subject  to  a  set  of  boundary 
conditions 


where  is  the  boundary  condition  operator  that  specifies  a  linear  combination 
of  the  state  variables  equal  to  the  boundary  value,  b^,  at  time,  t..  We 
are  concerned  only  with  those  cases  where  m>n  and  the  boundary  conditions  are 
to  be  met  in  a  least  squares  sense. 

A  shooting  method  is  to  superimpose  appropriately  independent  solutions  of 
equation  (1).  This  can  be  written  as 


y  ^ 


a . 


(3) 


where  p  is  a  particular  solution  of  (1)  and  a  .  is  the  corresponding  super 

tj 

imposition  constant. 

The  independence  properties  can  be  assured  by  the  following  strategy. 
Assume  (0)  =  a.  Then  take 


(0)  =  a  .  -h  6  ...g  . 

^  ^c7  J 


tJ  ‘Sj  •  •  •  j  ^ 


where  5  is  the  Kronecker  delta  and  all  3  •  0.  The  above  strategy  gives  a 

d 

determinant  (Wronskian)  of  the  associated  homogeneous  differential  equations, 
at  t=0,  of  the  product  of  the  3's. 
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The  superposition  o£  particular  solutions  also  requires  that 


n 

I  a.  =  1 

3=0  ^ 


(5) 


Childs  et  al.  [1970]  give  more  details  on  the  strategy  and  proof  of  (5). 

If  a  is  the  initial  value  vector  that  makes  (1)  satisfy  (2)  then  it  is  obvious 
that  aQ=  1  and  =...=  =  0.  How  close  the  actual  superposition 

constants  come  to  these  values  is  an  indication  of  the  merit  of  the  numerical 

method. 

I 

The  superposition  (3)  is  substituted  into  the  boundary  conditions  (2). 

If  the  boundary  conditions  are  linear  in  y,  then  the  operators  q  and  Z  may  be 
interchanged  giving 


n 


i  = 


(6) 


3=0 


(3) 


The  use  of  shooting  procedures  results  in  the  particular  solutions,  p  ,  being 
known  and  we  observe  that  the  bracketed  terms  in  (6)  are  simply  coefficients 
of  an  algebraic  equation  in  the  unknown  superposition  constants.  We  write 
(5)  and  (6)  as  the  matrix  equation 


Sa  =  d 


(7) 


where 

and 


’  '■ 


s.  •  =  ('tj)h 


d.  =  b. 


i  =  2^...,  m 

Q  —  jL  ^ 


2.  THE  OVER-DETERMINED  SYSTEM.  The  solution  of  the  system  (7)  is 
easily  obtained  for  the  data  sets  we  have  considered.  We  rewrite  (7)  parti 
tioning  (and  rearranging  if  necessary)  the  elements  of  the  vectors  and 


matrices 


’^1  i  ^2 

1 _ 

^1 

1 

— 

— 

— — 

-^5  i  ^4 

di 

1 

-  - 

(8) 
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The  equality  sign  is  understood  to  mean  "equality"  (as  much  as  our  numerical 
procedures  allow)  for  the  upper  portion  of  (8)  and  "least  squares"  fit  for 
the  lower  portion.  The  equality  conditions  come  from  the  superposition 
constraint  (5)  and  any  boundary  conditions  that  may  exist  which  should  be  met 
exactly."  Weighting  of  the  rest  of  the  boundary  conditions  can  be  done 
but  is  not  shown. 


A  straightforward  method  of  solution  is  by  elementary  operations 
(Gaussian  reduction  with  maximum  pivot  selection)  to  transform  (8)  into  the 

f 

equality  (P)  and  the  "least  square"  fit  (10) 

I  a  +  'S^a^  =  'd 


•S^ai  =  'd 


The  '()  denotes  the  values  have  been  affected  by  the  reduction  process.  Note 
that  '6'^  =  I  and  '5^  0. 

The  least  square  solution  for  in  (10)  is  obtained  in  the  usual  manner, 
the  normal  equations  result  from ‘premultiplying  by  the  transpose  of  The 

result  is  substituted  into  (9)  to  obtain  the  rest  of  the  a  vector. 

The  ”correct^’  initial  value  vector  can  be  calculated  from 
y(0)  =  p^^^(o)  +  Ba 

where  B  is  a  diagonal  matrix  with  indices  varying  from  0  through  n.  ^qq  ~ 

~  ~  ' 3  n.  Our  computational  procedure  is  to  repeat  the 

process  with  (0)  =  y(0)  from  (11)  such  that  we  should  have 


a .  =  0 
J 


0  ^  ^  •  •  •  j  ^ 


This  will  aid  in  construction  of  confidence  limits  of  parameter  estimates 
This  also  gives  a  convenient  quantity 

h  '  -h-  ‘i 

which  is  the  "predicted"  boundary  value. 
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3.  ANALYSIS  OF  VARIANCE.  An  uncorrected  ANOVA  table  is  presented  below 


in  terms  of  the  nomenclature  introduced.  The  significant  calculations  are 

A. 

presented  in  terms  of  vector  products.  These  products  are  over  the  q  and 
h  vectors  and  (xvc  G'L&inBYilyS  ccssoQ'Vd'b&d  W't'tlt  'LBCXS'b  squczvBS 

boundary  conditions  only^  any  exact  boundary  conditions  cere  ignored  in  these 
'products* 

TABLE  1 

ANOVA  TABLE 


Source 

Sum  of 

Squares 

Degrees  of 
Freedom 

Mean 

Square 

Due  to  regression 

q  b 

,  T,  -T, 

n 

SS/n 

2 

About  the  regression 
(residual) 

b  b  -  q  b 

m-k-n 

s 

Total  (uncorrected) 

T 

b^b 

m~k 

Notice  in  the  degrees  of  freedom  colijimn  that  Tn  is  the  total  number  of  boundary 

conditions  and  k  of  those  are  to  be  met  exactly. 

We  define  b  to  be  the  mean  of  the  least  square  boundary  values 

b  =  (I  b.)/(m~k) 

% 

The  following  formulae  are  used  to  calculate  the  usual  statistics: 

^  (15) 

(m-k) 

=  MS  (residual)  =  estimated  variance  of  system  (16) 

2 

^ cal  ~  (regression) /s  (1^) 
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In  (15)  the  summation  and  product  are  over  the  least  square  boundary  conditions. 
The  value  must  exceed  a  Fischer's  F  with: 

Probability  of  2  -  a  (a  is  the  producers  risk) 

Numerator  degrees  of  freedom  n 


Denominator  degrees  of  freedom  m-k-T 
for  the  regression  to  be  accepted. 

The  estimated  variances  of  the  boundary  values  are: 

est.  var.  (q^)  =  (’S^^)  [  CS^)  CS^)^  ]~^ 
where  the  i  subscript  denotes  the  ith  row  of  the  matrix.  The  resulting 
confidence  limits  in  terms  of  the  t  statistic  are: 

^  y/{est.  var  (q^)\ 

where  y  =  m-k-n  and  a  is  the  producer's  risk. 

We  have  stated  these  procedures  are  basic  to  system  identification 
procedures.  We  are  most  interested  in  y(0)  and  its  covariance  in  those  cases. 


Recall  equations  (9)  and  (10)  and  we  denote 
=  ('d^)^('d^)/(m-n) 

The  covariance  matrix  of  is 

Likewise,  the  covariance  matrix  of  a  is 

e 

The  final  covariance  matrix  is  formed  by  appropriate  multiplying  by  the 
perturbation  matrix  B  giving 


(20) 

(21) 

(22) 


The  row  and  column  of  the  result  are  null  reflecting  the  variance  of 

the  superposition  constraint  (5).  The  ith  diagonal  element  is  the  estimated 
variance  of  y.(0).  Its  square  root  is  the  estimated  standard  deviation  of 
u  (0).  The  confidence  limits  (which  Draper  and  Smith  point  out  should  be 
viewed  with  caution)  are 

y  .{0)  ±t(m-n,  l-a/2)  [est.  std.  dev.]^  (24) 

A  more  stringent  confidence  limit  would  be  a  hyperellipsoid  like 

^ (C  f^e  <  n  F(n,  m-n,  1-a)  (25) 

y  ~ 

vector  e  is  within  the  confidence  limits  about  y(0)»  This  kind  of 
statement  is  difficult  to  use  if  n>2. 


4.  AN  EXAMPLE.  Consider  the  problem  of  determining  the  coasting  dynamics 
of  an  automobile.  The  three  force  elements  of  a  model  of  the  phenomenon  are  [5] 

a.  Rolling  friction  due  to  tire  flexing  and  Coulomb  friction  on 
gear  train. 

b.  Aerodynamic  resistance  proportional  to  the  square  of  velocity. 

c.  Product  of  mass  and  deceleration. 

The  differential  equation  may  be  written  as 

X  +  (x)^  +  v.g  =  0  (26) 

M2  ^ 

where 


p  =  air  density  (slugs/ft  ) 

2 

A_p  =  frontal  area  of  vehicle  (ft  ) 

J 

M  =  mass  of  vehicle  (slugs) 

2 

g  =  acceleration  of  gravity  (ft/ sec  ) 
C,  =  coefficient  of  drag 


Vi^  =  rolling  friction  coefficient 


503 


X  =  velocity  (ft/sec) 

”  2 
X  =  acceleration  (ft/sec  ) 

Since  the  displacement,  x,  does  not  appear  in  (26),  we  can  write  this  as  a 
fii'st  order  ordinary  differential  equation  (substituting  y  for  A) 

M  2  ^ 

The  most  economical  procedure  to  obtain  sufficient  measurements  would  be  to 
coast  an  automobile  in  neutral  from  some  speed  like  say  80  miles/hour  and 
record  the  speed  at  intervals  of  say  5  seconds.  The  boundary  values  in 
Figure  1  are  velocities  in  ft/sec  at  5  second  intervals.  Since  (27)  is 
nonlinear,  the  usual  Newton  type  linearization  procedures  are  employed,  see 
Walker  [4]  in  this  proceedings  or  Childs  [Ij ,  [2]  for  more  details. 

Figure  2  is  the  output  of  our  program  which  is  based  on  (27),  the  data  in 
Figure  1,  parameter  values  common  in  engineering  use,  and  automobile  parameters 
from  the  Rond  and  Track  Test  Annual  for  1966  for  a  Sunbeam  Alpine.  The 
coefficient  of  drag  =  y ^  =  0.5025  ^  0,0690  and  coefficient  of  rolling  friction 
V-^  —  y^~  0.0169  ^  0,0031  resulted. 

5.  CONCLUSIONS.  Regression  analysis  with  differential  equation  models 
is  feasible.  It  could  significantly  affect  design  of  experiments  when  such 
models  are  relevant.  Determination  of  the  parameters  in  the  simple  example 
would  be  expensive  if  one  had  to  use  wind  tunnels  or  treadmills. 
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FIGURE  1 


FOLLOWING  IS  THE  OUTPUT  OF  THE  POST  STATISTICAL  STUDY 
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CONCLUDES  STAT  PACKA 


MULTI-LEVEL  ADAPTIVE  SOLUTIONS  TO 
BOUNDARY- VALUE  PROBLEMS* 


Achi  Brandt 

Weizmann  Institute  of  Science,  Rehovot,  Israel 
IBM  Thomas  J.  Watson  Research  Center,  Yorktown  Heights,  New  York  10598 

ABSTRACT.  The  boundary-value  problem  is  discretized  on  several  grids 
(or  finite-element  spaces)  of  widely  different  mesh  sizes.  Interaction* 
between  these  levels  enable  us  (i)  to  solve  the  possibly  nonlinear  system 
of  n  discrete  equations  in  0(n)  operations  (40n  additions  and  shifts  for 
Poisson  problems).  (ii)  To  conveniently  adapt  the  discretization  (the 
local  mesh  size,  local  order  of  approximation,  etc.)  to  the  evolving 
solution  in  a  nearly  optimal  way,  obtaining  "“-order”  approximations 
^  even  whe^  singularities  are  present.  General  theoretical  analysts 

of  the  numerical  process.  Numerical  experiments  with  linear  and  nonlinea  , 
elliptic  and  mixed-type  (transonic  flow)  problems  -  confirm  theoretical 

predictions . 


1.  introduction. 

In  most  numerical  procedures  for  solving  partial  ' 

the  analyst  first  discretizes  the  problem,  choosing  approximating  algebraic 
equations  on  a  finite  dimensional  approximation  space,  and  then  devises  a 
SJI^riTl  process  to  (nearly)  solve  this  huge  system  of  aiscrete 
Usually,  no  real  interplay  is  allowed  between  discretization  and  solution 
processes.  This  results  in  enormous  waste:  The  discretization  process , 
being  unable  to  predict  the  proper  resolution  and  the  proper  order  o 
approximation  at  each  location,  produces  a  mesh  which  is  too  fine, 
algebraic  system  thus  becomes  unnecessarily  large  in  size,  while  accu  y 
usLlly  remains  rather  low,  since  local  smoothness  of  the  solution  is  not 
being  properly  exploited.  On  the  other  hand,  the  solution  process  fails 
to  tLe  advantage  of  the  fact  that  the  algebraic  system  to  be  solved 
not  stand  by  itself,  but  is  actually  an  approximation  to  ' 

and  therefore  can  itself  be  approximated  by  other  (much  simpler)  algebraic 

systems . 

The  purpose  of  the  work  reported  here  is  to  study  how  to  intermix 
discretization  and  solution  processes,  thereby  making  both  of  them 
orders-of-magnitude  more  effective.  The  method  to  be  proposed  is  not 
"saturated",  that  is,  accuracy  grows  indefinitely  as  computations  p 
ceed.  The  rate  of  convergence  (overall  error  E  as  function  o  -d^. 

tational  work  W)  is  in  principle  of  "infinite  order  ,  e.g.,  ^  ^  (  ^) 

for  a  d-dimensional  problem  which  has  a  solution  ^^^h  scale-ratios  |3>0, 
or  E  e:q)  (-W^'”)  ,  for  problems  with  arbitrary  thin  layers  (see  Sec.  9). 

*  The  research  reported  here  was  partly  supported  by  the  Israel  Commission 

for  Basic  Research.  Part  of  the  research  was  ® 

for  computer  Applications  in  Science  and  Engineering  (ICASE) ,  NASA 
Langley  Research  Center f  Hampton,  Virginia, 
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The  basic  idea  of  the  Multi  Level  Adaptive  Techniques  (MLAT)  is  to 
work  not  with  a  single  grid,  but  with  sequence  of  grids  ("levels")  of 
increasing  fineness,  which  may  be  introduced  and  changed  in  the  process,  and 
1C  cons  antly  interact  with  each  other.  For  description  purposes,  it 
IS  convenient  to  regard  this  technique  as  composed  of  two  main  concepts: 

— -g-  method  for  solving  discrete  equations.  This 

method  Iteratively  solves  a  system  of  discrete  (finite-diffLence  or  finite- 
e  ^ent)  equations  on  a  given  grid,  by  constant  interactions  with  a  hier¬ 
archy  of  coarser  grids,  taking  advantage  of  the  relation  between  different 
discretizations  of  the  same  continuous  problem.  This  method  can  be 
viewed  in  two  complimentary  ways:  One  is  to  view  the  coarser  grids  as 
correction  grids,  accelerating  convergence  of  a  relaxation  scheme  on  the 
finest  grid  by  efficiently  liquidating  smooth  error  components 
(See  general  description  in  Sec.  2  and  algorithm  in  Sec.  4.)  Another  point 
of  view  IS  to  regard  finer  grids  as  the  correction  grids,  improving  ac¬ 
curacy  on  coarser  grids  by  correcting  their  forcing  terms.  The  latter 
IS  a  very  useful  point  of  view,  making  it  possible  to  manipulate  ac¬ 
curate  solutions  on  coarser  grids,  with  only  infrequent  "visits"  to 
pieces  of  finer  levels.  (This  is  the  basis  for  the  multi-grid  treatment 
of  non-uniform  grids;  cf.  Secs.  7.2  and  7.5.  The  FAS  mode  for  nonlinear 
problems  and  the  adaptive  procedures  stem  from  this  viewpoint.)  The  two 
seemingly  different  approaches  actually  amount  to  the  same  algorithm  (in 
the  simple  case  of  "coextensive"  levels) . 

The  multi-grid  process  is  very  efficient;  A  discrete  system  of  n 
equations  (n  points  in  the  finest  grid)  is  solved,  to  the  desired  accuracy, 
in  0(n)  computer  operations.  If  p  parallel  processors  are  available, 
toe  required  number  of  computer  steps  is  0(n/P  +  log  n) .  For  example,  only 
n  additions  and  shifts  are  required  for  solving  the  5-point  Poisson 
equation  on  a  grid  with  n  points  (see  Sec.  6.3).  This  efficiency  does 
not  depend  on  the  shape  of  the  domain,  the  form  of  the  boundary  conditions, 
or  the  mesh-size,  and  is  not  sensitive  to  choice  of  parameters.  The  memory 
area  required  is  essentially  only  the  minimal  one,  that  is,  the  storage  of 
toe  problem  and  the  solution.  In  fact,  if  the  amount  of  numerical  data  is 
small  and  only  few  functionals  of  the  solution  are  wanted,  the  required 
memory  is  only  O(log  n) ,  with  no  need  for  external  memory  (see  Sec.  7.5). 

Multi-grid  algorithms  are  not  difficult  to  program,  if  the  various 
grids  are  suitably  organized.  We  give  an  example  (Appendix  B)  of  a 
FORTRAN  program,  showing  the  typical  structure,  together  with  its  computer 
output,  showing  the  typical  efficiency.  With  such  an  approach,  the  program¬ 
ming  of  any  new  multi-grid  problem  is  basically  reduced  to  the  programming 
of  a  usual  relaxation  routine.  The  same  is  true  for  nonlinear  problems 
where  no  linearization  is  needed,  due  to  the  FAS  (Full  Approximation  Storage) 
method  introduced  in  Sec.  5. 


Multi-grid  solution  times  can  be  predicted  in  advance,  -  a 
recipe  is  given  and  compared  with  numerical  tests  (Sec.  6).  The  basic 
tool  is  the  local  mode  (Fourier)  analysis,  applied  to  the  locally 
linearized-freezed  difference  equations,  ignoring  far  boundaries.  Such 
an  analysis  yields  a  very  good  approximation  to  the  behavior  of  the 
high-frequency  error  modes,  which  are  exactly  the  only  significant  modes 
in  toe  multi-grid  process,  since  the  low-frequency  error  inodes  are 
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liquidated  in  the  coarse-grids  processing,  with  negligible  amounts  of  com¬ 
putational  work.  Thus,  mode  analysis  gives  a  very  Pf °f 

convergence  rates  per  unit  work.  (For  model  problems,  the  analysis  can  be 
made  rigorous;  see  Appendix  C.)  The  mode  analysis  can,  ^ 

used  to  choose  suitable  relaxation  schemes  (Sec.  3)  and  suitable  ^riteri 
for  switching  and  interpolating  between  the  grids  (Appendix  ^  ^ 

numerical  tests  ranged  from  simple  elliptic 

type  (transonic  flow)  problems,  which  included  hyperbolic  regio^  ^d 
discontinuities  (shocks).  The  results  show  that,  as  ^  . 

mode  analysis,  errors  are  reduced  by  an  order  of  magnitude  (factor  10) 
expending  computational  work  equivalent  to  4  to  5  relaxation  sweeps  on 

the  finest  grid. 

(2)  Adaptive  discretization.  Mesh-sizes,  orders  of  approximation 
and  other  discretization  parameters  are  treated 

using  certain  general  internal  criteria,  these  variables  are  controll^ 

criter^i"SleiisS\f  Sttin  mSS'overalf  a^ 

othe^lse  th, 

(^e^USreaH  “kL 

(that  otherwise  may  "contaminate"  the  whole  solution) ,  exploit  local 
smoothness  of  solutions  (in  proper  scale),  etc.  (see  Sec.  9). 

Multi-grid  processing  and  adaptive  discretization  can  be  used  in- 
ri,ir,onr^AnMv  of  each  other,  but  their  combination  is  very  fruitful.  MG 
lllle  onlj  fast  (and  convenient)  method  to  solve  discrete  equations  on 
n^-uilform  grids  typically  produced  by  the  adaptive 
iterative  character  fits  well  into  the  a  Xn  particular, 

^\S?cieif  anHe^^lSr^arto  construct  ^ 

poi„;s.  Which  .»hea  it  e^y  “  in 

?rr  sinit^T-i 

S,etheAy  the  „lti-,tia  ptcceee 

(Sec.  7). 

prS;.,r'1Sc;io«l“I^»itation  prcblewe,  etc  ,  »d^to^ 

finite-elements  discretization.  The  latter  is  orieriy 
Secs.  A. 5  and  7.3. 
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2.  MULTI-GRID  PHILOSOPHY. 

suppose  we  have  a  set  of  grids  all  approximating  the 

same  domain  with  corresponding  meshsizes  h  >h  >...>h  .  For  simplicity 
one  can  think  of  the  familiar  uniform  square  grids,  witli  the  mesh-size 
ratio  Suppose  further  that  a  differential  problem  of  the 

form 


(2.1) 


LU(x)  =  F(x)  in  Au(x)  =  f(x)  on  the  boundary  30, 


is  given.  On  each  grid  G^  this  problem  can  be  approximated  by  difference 
equations  of  the  form 

(2.2)  L^U^(x)  =  F^'^Cx)  for  xeG*^,  A^U^(x)  =  $^(x)  for  xe9G  . 


(See  example  in  Sec.  3.1).  We  are  interested  in  solving  this  discrete 
problem  on  the  finest  grid,  G 


The  main  idea^^is  to  exploit  the  fact 
that  the  discrete  problem  on  a  coarser  grid,  G  say,  approximates  the 


same  differential  problem  and  hence  can  be  used  as  a  certain  approximation 
to  the  G®  problem.  A  simple  use  of  this  fact  has  long  been  made  by 
various  authors  (e.g.,  [14] );  namely,  they  first  solved  (approximately) 
the  G^  problem,  which  involves  an  alg^raic  system  much  smaller  and  us 
much  easier  to  solve  than  the  given  G  problem,  and  then  they  iterpolate 
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in  some  iterative  prooese  tot  solving  the  c"  problaa.  A  mote  advanced 
technique  wasj^to  use  a  still  coarser  grid  in  a  similar  manner  when 
solving  the  G  problem,  and  so  on.  The  next  natural  step  is  to  ask 
whether  we  can  exploit  the  proximity  between  the  and  G^  problems 
not  only  in  generating  a  good  first  approximation  on  G®,  but  also  in  the 
process  of  improving  the  first  approximation, 

and  let^^  specifically  let  u^  be  an  approximate  solution  of  the  g”  problem. 


(2.3)  u^  =  -  f^,  u^  = 

residual  functions,  or  residuals 


=  -  f  M 


.M  M  .M 
A  u  =  0 


Assuming  for  simplicity  that  L  and  A  are  linea 
linear  case)^  the  exact  discrete  solution  is 
correction  satisfies  the  residual  equations 

(2.4)  v”  =  fM,  a'' 


^Sec,  5  for  the  non- 
,  where  the 


Can  we  solve  this  equation,  to  a  good  first  approximation,  again  by  inter¬ 
polation  from  solutions  on  coarser  grids?  As  it  is,  the  answer  is 
generally  negative.  Not  every  G  -problem  has  meaningful  approximation 
on  a  coarser^^grid  G  .  For  instance,  if  the  right-hand-side  f®  fluctuates 
rapidly  on  G  ,  with  wavelength  less  than  4h  ,  these  fluctuations  are  not 
visible  on  coarser  grids.  Such  rapidly- fluctuating  residuals  f"  are  ex¬ 
actly  what  we  get  when  the  approximation  u“  has  itself  been  obtained  as 
an  interpolation  from  a  coarser-grid  solution. 

An  effective  way  to  damp  rapid  fluctuations  in  residuals  is  by  usual 
relaxation  procedures,  e.g. ,  the  Gauss-Seidel  relaxation  (see  Sec.  3). 

At  the  first  few  iterations  such  procedures  usually  seem  to  have  fast 
convergence,  with  residuals  (or  corrections)  rapidly  decreasing  from  one 
Iteration  to  the  next,  but  soon  after  the  convergence  rate  levels  off 
and  becomes  very  slow.  Closer  examination  (see  Sec.  3  below)  shows  that 
the  convergence  is  fast  as  long  as  the  residuals  have  strong  fluctuations 
on  the  scale  of  the  grid.  As  soon  as  the  residuals  are  smoothed  out, 
convergence  slows  down. 


^is  IS  then  exactly  the  point  where  relaxation  sweeps  should  be 
discontinued  and  approximate  solution  of  the  (smoothed  out)  residual 
equations  by  coarser  grids  should  be  etnnlnva,^  ■ 


The  Multi-Grid  (MG)  methods  are  systematic  methods  of  mixing  relax¬ 
ation  sweeps  with  approximate  solution  of  residual  equations  on  coarser 
grids.  The  residual  equations  are  in  turn  also  solved  by  combining 
relaxation  sweeps  with  corrections  through  still  coarser  grids,  etc.  The 
coarsest  grid  G  is  coarse  enough  to  make  the  solution  of  its  algebraic 
system  inexpensive  compared  with,  say,  one  relaxation  sweep  over  the 
finest  grid. 
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The  following  sections  further  explain  these  ideas.  Sec.  3.1  explains, 
through  a  simple  example,  what  is  a  relaxation  sweep  and  shows  that  it  indeed 
smooths  out  the  residuals  very  efficiently.  The  smoothing  rates  of 
difference  systems  are  summarized  in  Sec.  3.2.  A  full  multi-grid  algor 
composed  of  relaxation  sweeps  over  the  various  grids  with  suitable  inter¬ 
polations  in  between,  is  then  presented  in  Sec.  4.  An  ^portant  modification 
for  nonlinear  problems  is  described  in  Sec.  5  (and  used  later  as  the  basic 
algorithm  for  non-uniform  grids  and  adaptive  procedures) .  Appendix  A  supple¬ 
ments  these  with  suitable  stopping  criteria,  details  of  the  interpolation 
procedures  and  special  techniques  (partial  relaxation) . 


3.  RELAXATION  AND  ITS  SMOOTHING  RATE. 

3.1  An  Example.  Suppose,  for  example,  we  are  interested  in  solving 
the  partial  differential  equation. 

,  ,  _  a^u(x,y)  .  .  3^u(x,yl  ^  F(x,y) 

(3.1)  LU(x,y)  =a  - +0  2  vx,y; 

3x  9y  _k  k  ■  • 

(3.2)  lV  H  a  *  b  ~  ^  =  Cg 

a,e  hj^ 

U^(ahj^,ghj^) ,  =  F’^(ahj^,ehj^);  a,e  integers. 

(In  the  multi-grid  context  it  is  important  to  define  the  difference  equations 
in  this  divided  form,  without,  for  example,  multiplying  throughout  by  hj^  , 
in  TAlr  to  get  the  proper  relative  scale  at  the  different  levels.)  G^en 
an  approximation  u  to  U^,  a  simple  example  of  a  relaxation  scheme  to  improve 

it  is  the  following. 

Gauss-Seidel  Relaxation:  The  points  (a,e)  of  are  scanned  ^ 
in  some  prescribed  order;  e.g.,  lexicographic  order.  At  ®®f 
u  is  replaced  by  a  new  value,  u  ,  such  that  equation  (3.2)  at  that  point 

is  satisfied.  That  is,  u^  ^  satisfies 


(3.3) 


W.6  ~  ""  a-l,g  ^  ^  ""otrg+l 


2U  ^  ri  ft  1 

g.B  gfP-l 


MVioro  new  values  u  u  „  .  are  used  since,  in  the  lexicographic 

order,  by  the  time  (a,g)'^li'scaAieJ  new  values  have  already  replaced  old 

values  at  (g-lf3)  (g,3“’l)*  ^ 

A  complete  pass,  scanning  in  this  manner  all  the  points  of  G  ,  is 
called  a  (Gauss-Seidel  lexicographic)  G  relaxation  swee£.  The  new  ap¬ 
proximation  u  does  not  satisfy  (3.2),  and  further  relaxation  sweeps  may 
be  required  to  improve  it.  An  important  quantity  therefore  is  the 
rate  of  convergence ,  y  say ,  which  may  be  defined  by 
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(3.4)  y  = 


V 


k  -  k 

\^ere  v  -  U  -u,  v  -  U 


-  u  , 


f 


11*11  being  any  suitable  discrete  norm. 

The  rate  of  convergence  of  the  above  relaxation  scheme  is 
asymptotically  very  slow.  is^  except  for  the  first  few  relaxation 

sweeps  we  have  y  =  1  -  0(h^  ).  This  means  that  we  have  to  perform  0(h 

relaxation  sweeps  to  reduce  the  error  order  of  magnitude.  ^ 

In  the  multi-grid  method,  however,  the  role  of  relaxation  is  not  to 
reduce  the  error,  but  to  smooth  it  out;  i.e.,  to  reduce  the  high-frequency 
components  of  the  error  (the  lower  frequencies  being  reduced  by  relax¬ 
ation  sweeps  on  coarser  grids).  In  fact,  since  smoothing  is  basically 
a  local  process  (high  frequencies  have  short  coupling  range) ,  we  can 
analyze  it  in  the  interior  of  G  by  (locally)  expanding  the  error  in 

Fourier  series.  This  will  allow  us  to  study  separately  the  convergence 

rate  of  each  Fourier  component,  and,  in  particular,  the  convergence 
rate  of  high-frequency  components,  which  is  the  rate  of  smoothing. 

Thus  to  study  the  Fourier  component  of  the  error  functions 

v  and  V  before  and  after  the  relaxation  sweep,  we  put 


i(6  a+e^B)  _  i(e  a+e  B) 

^a,B  ■  ^  ^  ^a,B  ==  ""e  ® 

Siabtracting  (3.2)  from  (3.3),  we  get  the  relation 


oi+1,3  a,$  a-1,3^  oi,3+l  a,e  *  a, 3-1 


=  0, 


from  which,  by  (3.5), 

i01  ie^  -16  -i0 

(ae  +  ce  )  +  (ae  +c  e  -  2a  -  2c)  =  0. 


Hence  the  convergence  rate  of  the  6  component  is 


(3.7)  M(0) 


ie 

1  . 

2 

ae  + 

ce 

. 

CD 

•H 

• 

<D 

•H 

2a+2c  -  ae 

1  2 
-  ce 

Define  |6|  =  max  (|6^|,|62|)-  In  domains  of  diameter  0(1)  the  lowest 

Fourier  components  ^ave  |0|  =  o(hj^),  and  their  convergence  rate  therefore 
is,  p  (0)  =  1  -  o(hj^  )  .  Here,  however,  we  are  interested  in  the  rate  of 
smoothing ,  which  is  defined  by 


(3.8)  y  =  max  P  (0)  , 

PTT  <_  |6  I  _<  TT 
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6  _<  TT  is  the  suitable 


A  A 

where  p  is  the  mesh-size  ratio  and  the  range  pir  _< 

range  of  high-frequency  components,  i.e.,  the  range  of  cornponents  that 

cannot  be  approximated  on  the  coarser  grid,  because  its  mesh-size  is 

h  =  h  /  p.  We  will  assume  here  that  P  =  v  *  which  is  the  usual  rai 
3c””  1  k 

(cf.  Sec.  6.2). 

Consider  first  the  case  a=c  (Pois|on  equation) .  A  simple  cal¬ 
culation  shows  that  y  =  y (  ^  ,  arccos  — )  =  . 5 .  This  is  a  very  satis¬ 
factory  rate;  it  implies  that  3  relaxation  sweeps  reduce  the  high- 
frequency  error- components  by  almost  an  order  of  magnitude.  Similar 
rates  are  obtained  for  general  a  and  c,  provided  a/c  is  of  moderate 
size. 

The  rate  of  smoothing  is  less  remarkable  in  the  degenerate  case 
a«c  (or  c«a) .  For  instance, 

1/2 


(3.9) 


y  ( 


0  )  = 


a2  + 


a^  t 


(c+2a) 


which  approaches  1  as  a  -»■  o.  Thus,  for  problems  with  such  a  degeneracy, 
Gauss-Seidel  relaxation  is  not  a  suitable  smoothing  scheme.  But  other 
schemes  exist.  For  example. 

Line  Relaxation;  Instead  of  treating  each  point  (a,^)  of  G  separ¬ 
ately,  one  take* simultaneously  a  line  of  points  at  a  time,  where  a  line 
is  the  set  of  all  points  (oi,3)  in  G  with  the  same  ot  (a  vertical  line) . 

All  the  values  u  .  on  such  a  line  are  simultaneously  replaced  by  new 
values  u  which^^simultaneously  satisfy  all  the  equations  (3.2)  on  that 
line.  '^^(This  is  easy  and  inexpensive  to  do,  since  the  system  of 
equations  to  be  solved  for  each  such  line  is  a  tridiagonal,  diagonally 
dominant  system.  See,  e.g. ,  in  [17].)  As  a  result,  we  get  the  same  relation 
as  (3.3)  above,  except  that  u^  is  replaced  by  Hence,  instead 

of  (3.7)  we  will  get:  ' 


(3.10) 


y(0)  = 


-i0 


2(a+c  -  c  COS02) 


ae 


from  which  one  can  derive  the  smoothing  rate 


(3.11) 


y  =  max 


{ 


■1/2 


}  ■ 


'  a+2c 

which  is  very  satisfactory,  even  in  the  degenerate  case  a«c 


3.2.  General  results .  The  above  situation  is  very  general  (see  [4] 
and  Chapter  3  of  [3] ) s  For  any  uniformly  elliptic  system  of  difference 
equations,  it  can  be  shown  that  few  relaxation  sweeps  are  enough  to 
reduce  the  high-frequency  error  components  by  an  order  of  magnitude. 
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The  same  holds  for  degenerate  elliptic  systems ,  provided  a  suitable  relax¬ 
ation  scheme  is  selected.  A  scheme  of  line-relaxation  which  alternately 
use  all  line  directions  and  all  sweeping  directions  is  suitable  for 

degenerate  case.  Moreover,  such  a  scheme  is  suitable  even  for  non- 
elliptic  systems,  provided  it  is  used  ” selectively" ;  i.e.,  the  entire 
domain  is  swept  in  all  directions,  but  new  values  are  not  introduced 
at  points  where  a  local  test  shows  the  equation  to  be  non-el liptic  and 
the  forward  characteristic  direction  to  conflict  with  the  current 
sweeping  direction. 

By  employing  local  mode  analysis  (analysis  of  Fourier  components) 
similar  to  the  example  above,  one  can  explicitly  calculate  the  smoothing 
y  for  any  given  difference  equation  with  any  given  relaxation  scheme. 
(Usually  ]i  should  be  calculated  numerically;  an  efficient  FORTRAN  sub¬ 
routine  exists;  typical  values  are  given  in  Table  1,  in  Sec.  6.2).  In 
this  way,  one  can  select  the  best  relaxation  scheme  from  a  given  set  of 
possibilities.  The  selection  of  the  difference  equation  itself  may  also 
take  this  aspect  into  account.  This  analysis  can  also  be  done  for 
non-linear  problems  (or  linear  problems  with  non-constant  coefficients) , 
by  local  linearization  and  coefficients  freeze.  Such  localization  is 
fully  justified  here,  since  we  are  interested  only  in  a  local  pro- 
perty  (the  property  of  smoothing.  By  contrast,  one  cannot  make  similar 
mode  analysis  to  predict  the  overall  convergence  rate  y  of  a  given  relax¬ 
ation  scheme,  since  this  is  not  a  local  property) . 

An  important  feature  of  the  smoothing  rate  y  is  its  insensitivity. 

In  the  above  example  no  relaxation  parameters  were  assumed.  We  could 

introduce  the  usual  relaxation  parameter  to;  i.e.,  replace  at  each  point 

the  old  value  u  not  with  the  calculated  u  ^ ,  but  with 
oi/P  a, 3' 

analysis  shows,  however,  that  no  to  1 

provides  a  smoothing  rate  better  than  to=l.  In  other  cases,  a)=l  is  not 

optimal,  but  its  y  is  not  significantly  larger  than  the  minimal  y.  in 

delayed-displacement  relaxation  schemes  a  value  o)  <  o)  .  .  <1  should 

_  crrtrcal 

often  be  used  to  obtain  y  <  1,  but  there  is  no  sensitive  dependence  on  the 
precise  value  of  o),  and  suitable  values  are  easily  obtained  from  the  local 
mode  analysis .  Generally ,  the  smoothing  rate  of  delayed-displacement 
schemes  is  somewhat  worse  than  that  of  immediate-displacement  schemes, 
and  the  latter  should,  therefore,  be  preferred,  except  when  parallel 
processing  is  used. 


3.3.  Acceleration  by  weighting .  The  rate  of  smoothing  y  may  some¬ 
times  be  further  improved  by  various  parameters  introduced  into  the 
scheme.  Since  y  is  reliably  obtained  from  the  local  mode  analysis,  we 
can  optimize  these  parameters  to  minimize  y.  For  linear  problems, 
such  optimal  parameters  can  be  determined  once  and  for  all,  since  they 
do  not  depend  on  the  shape  of  the  domain.  For  nonlinear  problems  precise 
optimization  is  expensive  ,  and  one  should  prefer  the  simpler,  more  ro¬ 
bust  relaxation  schemes,  such  as  SOR. 

One  general  way  of  parametrization  is  the  weighting  of  corrections. 
We  first  calculate,  in  any  relaxation  scheme,  the  required  correction 
\  ~  \  (where  v  =  (a, 6)  or,  for  a  general  dimension  d. 
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V  —  *** 

corrections,  we 
combination  of 


,v  ) ,  V  integers).  Then,  instead  of  introducing  these 
d  j 

actually  introduce  corrections  which  are  some  linear 
(Ju  at  neighboring  points.  That  is,  the  actual  new 


values  are 


(3.12) 


“v  ■  “v  *  yir  “y  ®“v+y 


in- 


where  the  weights  are  the  parameters,  y  -  - 'd'  '  *3 

tegers  and  r  a  small  set  near  (0,0,  . . . ,0) .  For  any  fixed  T  we  can 
optimize  the  weights.  In  case  T  =  {0} ,  o)  is  the  familiar  relaxation 
parameter.  Weighting  larger  T  is  useful  in  delayed  displacement  relax¬ 
ation  schemes.  For  immediate-displacement  line  relaxation,  weighting 
along  the  line  may  be  useful. 

Examples.  In  case  of  simultaneous  displacement  (Jacobi)  relaxation 
for  the  5-points  Poisson  equation,  the  optimal  weights  for  r={o}  is 

60.  For  the  set 


0)  =  .8,  for  which  the  smoothing  rate  is  y  = 

00 


r  =  ^  ^  optimal  weights  are 

=  48/41,  yielding  y  =  9/41.  This  rate  seems  very  attractive; 


03  =  603  . 

00  o,+l 


603 


+1,0 


Actually, 


the  smoothing  obtained  in  one  sweep  equals  that  obtained  by 
^Qg  )  /  (log  j)  =  2.2  sweeps  of  Gauss-Seidel  relaxation 

however,  each  sweep  of  this  weighted- Jacobi  relaxation  requires  9  additions 
and  3  multiplications  per  grid  point,  whereas  each  Gauss-Seidel  sweep 
requires  only  4  additions  and  1  multiplication  per  point,  so  that  the 
two  methods  have  almost  the  same  convergence  rate  per  operation,  Gauss- 
Seidel  being  slightly  faster.  The  weighted  Jacobi  scheme  is  consider^ly 
more  efficient  than  any  other  simultaneous-displacement  scheme,  but  like 
any  carefully  weighted  scheme,  it  is  considerably  more  sensitive  to 
various  changes . 


The  acceleration  by  weighting  can  be  more  significant  for  higher 
order  equations.  For  the  13-points  biharmonic  operator,  Gauss-Seidel 
relaxation  requires  12  additions  and  3  multiplications  per  grid  point 
and  gives  ii  =  .802,  while  weighted  Jacobi  (with  weights  =  1.552, 

0)  =  u  =  .353)  requires  17  additions  and  5  multiplications  per 

o,+l  +1/0 

point  and  gives  \i  =  .549,  which  is  2.7  ti^es  faster.  (The  best  relax¬ 
ation  sweep  for  the  biharmonic  equation  A  U  =  F  is  to  write  it  as  the 
system  AV=F,  AU=V  and  sweep  Gauss-Seidel,  alternatively  on  U  and  V. 
Such  a  double  sweep  costs  8  additions  and  2  multiplications  per  grid 
point,  and  yields  vi=.5.  But  a  similar  procedure  is  not  possible  for 
general  4-th  order  equations . ) 
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4.  A  MULTI-GRID  ALGORITHM  CCYCI^  C)  FOR  LZNEAK  PROBLEMS 


There  are  several  actual  algorithms  for  carrying  out  the  basic  multi¬ 
grid  idea,  each  with  several  possible  variations.  We  present  here  an  algor- 
itlm  (called  "Cycle  C"  in  [3] )  which  is  easy  to  program,  generally  ap¬ 
plicable  and  never  significantly  less  efficient  than  the  others  ("Cycle  A" 
and  "Cycle  B") .  The  operation  of  the  algorithm  for  linear  problems  is 
easier  to  learn,  and  is  therefore  described  first.  In  the  next  section 
the  FAS  (Full-Approximation-Storage)  mode  of  operation,  suitable  for  non¬ 
linear  problems  and  other  important  generalizations,  will  be  described. 

A  flow-chart  of  the  algorithm  is  given  in  Figure  1.  (For  completion,  we 
also  flowchart,  in  Fig.  2,  Cycles  A  and  BJ  A  sample  FORTRAN  program  of 
this  cycle,  together  with  a  computer  output,  is  given  in  Appendix  B. 

Cgcle  C  starts  with  some  approximation  u^  being  given  on  the  finest 
grid  G  .  In  the  linear  case  one  can  start  wi?h  any  approximation,  but 

a  major  part  of  the  computations  is  saved  if  u^  has  smooth  residuals 
Wl  o 

(e.g.,  if  u  satisfies  the  boundary  conditions  and  is  smooth. 

As  explained  in  Sec.  6,  smoothing  the  residuals  involves  most  of  the 
con^utational  effort) .  In  the  nonlinear  case /  one  may  have  to  use  a 
continuation  procedure,  usually  performed  on  coarser  grids 
(cf.  Sec.  8.2).  Even  for^^linear  problems,  the  most  efficient 
algorithm  is  to  gb^ain  u  by  interpolating  from  an  approxi¬ 
mate  solution  u  calculated  on  by  a  similar  al¬ 

gorithm.  (Hence  the  denomination  "cycle"  for  our  present 
algorithm,  which  would  generally  serve  as  the  basic  step  in 
processes  of  continuation,  refinement  and  grid  adaptation, 
or  as  a  time  step  in  evolution  problems) .  For  highest  ef¬ 
ficiency,  the  interpolation  from  u”~  to  u“  should  be  of 
sufficiently  high  order,  to  exploit  all  smoothness  in  u®~^. 

(Cf.  (A. 7)  in  Sec.  A. 2,  and  see  also  Sec.  6.3.) 

•  •  1 

The  f>gsic  rule  in  Cycle  C  is  that  each  v  (the  function  defined  on 
the  grid  G  ;  k=0,  M-1)  is  designed  to  serve  as  a  correction  for 

the  approximation  v  previously  obtained  on  the  next  finer  grid  G  , 
if  and  when  that  approximation  actually  requ^j^s  a  coarse-grid  cor¬ 
rection,  i.e.,  if  and  when  relaxation  over  G^  exhibits  slow  rate  of 
cgnverg|nce.  Thus,  the  equations  to  be  (approximately)  satisfied  by 
v^  are 


(4.1)  lV  =  f^,  aV  =  ij)^, 

k  k 

where  f  and  <()  are  the  residuals  (to  the  interior  equation  and  the 
boundary  condition,  respectively)  left  by  v^  that  is. 


(4.2) 


k  k  ,^k+l  k+1  k+1. 

f  =  (f  -  L  V  )  , 


.k  k  /ak+1  k+1. 

^  -  A  V  ) . 


We  denote  by  V  the  functions  ig  the  equations,  to  distinguish  from 
their  compjit|d  approximatiojis  v  .  When  v  is  changing  in  the  algorithm 
(causing  V  to  change) ,  V  remains  fixed. 
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Figure  1.  Cycle  C,  Linear  Problems. 
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Figure  2.  Cycles  A  and  B,  Linear  Problems. 
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We  use  the  notation  I  to  represent  interpolation  from  G  to  G  •  In 
case  m>k/  may  represent  a  simple  transfer  of  values  to^the  coarser 
grid  froS  the  corresponding  points  in  the  finer  grid  G  ;  or  instead, 
it  may  represent  transfer  of  some  weighted  averages.  In  case  k>m,  as  in 
step  (e)  below,  is  usually  a  polynominal  interpolation  of  a  suitable 
order  (at  least  tKe  order  of  the  differential  equation.  See  Secs.  A. 2 
and  A. 4  for  more  details). 


Tbe 

solution  on  G 
ones;  namely 


ns  on  are  thus  defined  in  terms  of  the  approximate 
,  On  the  finest  grid  G^,  the  equations  are  the  original 


(4.3) 


M  M 
V  =  u  . 


The  steps  of  the  algorithm  are  the  following: 


(a)  Set  k-^M  (k  is  the  working  level,  and  we  start  ag  the^finest 
level),  and  introduce  the  given  approximation  v  u^  . 

k  • 

(b)  Improve  v  by  one  relaxation  sweep  for  the  difference  equations 

(4.1).  Symbolically,  we  write  such  a  sweep  as 
(4.4)  v^  ’<■  Relax  =  f^,  v^ 

(c)  If  relaxation  has  sufficient!?  converged  (the  precise  cri¬ 
terion  is  described  in  Secs.  A. 7  and  A.8) ,  go  to  Step  (f ) . 

If  not,  and  if  the  convergence  rate  is  still  fast  (by  a  cri¬ 
terion  given  in  Sec.  A. 6)  go  back  to  Step  (b) .  If  con¬ 
vergence  is  not  obtained  and  the  rate  is  slow,  go  to  Step  (d) . 


(d)  If  k=o  (the  slow  convergence  has  taken  place  at  the  coarsest 
grid  G°) ,  go  back  to  Step  (b)  (to  continue  relaxation  never¬ 
theless,  since  on  G®  relaxation  is  very  inexpensive.  If,  how¬ 
ever,  the  problem  is  indefinite,  then  slow  rate  of  divergence 
may  occur,  in  which  case  the  G°  problem  should  be  solved  directly. 
This  is  as  inexpensive  as  relaxation,  but  requires  additional 
programming.  See  Sec.  4.1  below).  If  k>o,  lower  k  by  1  (^o 
compute  correction  on  the  next,  coarser  ^evel) .  Compute  f  and 
<l>  on  this  new  level,  using  (4.2),  put  v  =o  as  the  starting  ap¬ 
proximation,  and  go  to  Step  (b) . 


(e)  If  k=M  (convergence  has  been  obtained  og  the  finest  level) , 
the  algorithm  is  terminated.  If  has  converged  and  is 

ready  to  serve  as  a  correction  to  v  ) ,  put 


(4.5) 


k+1  _  k+1  ^  ^k+1  „k 

V  -4-  V  +  I  ,  V 

k 


Then  advance  k  by  1  (to  resume  computations  in  the  finer  level) 
and  go  to  Step  (b) . 


The  storage  required  for  this  algorithm  is  only^a  fraction  more  than 
the  number  of  locations,  2n  say,  required  to  store  u  and  F  on  the  finest 
grid.  Indeed,  for  a  d-dimensional  problem^r^a  storage  of  roughly  2n/2 
locations  is  required  to  store  v  and  f  ,  the  next  level  requires  2n/ 
2^  ,  etc.  The  total  for  all  levels  is 
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(4.6) 


2na+2"'^+2"^^+... 


1  <  2n 


2  -1 


(In  the  FAS  version  below,  a  major  reduction  of  storage  area  is  possible 
through  segmental  refinement.  See  Sec.  7.5) 


4,1.  ^Indefinite  Problems  and  the  Size  of  the  Coarsest  Grid.  If,  on 
any  grid  G  ,  the  boundary  value-problem  (4.1)  is  a  non-definite  elliptic 
problem,  with  eigenvalues 


(4.7) 


and  with  the  corresponding  eigenfunctions  V^,  v!^,...,V^,  V , . . . , 

X  X  Xt*  X 


then  it  cannot  be  solved  by  straight  relaxation.  Any  relaxation  sweep 
will  reduce  the  error  components  in  the  space  spanned  by  V  ^  ,  V  ^  , 

but  will  magnify  all  components  in  the  span  of  V^.  A  multi¬ 

grid  solution,  however,  is  not  seriously  affected  by^this  magnification, 
provided  the  magnified  components  are  suitably  reduced  by  the  coarse- 
grid  corrections.  This  will  usually  be  the  case,  since  these  components 
are  basically  of  low  frequency  and  are  well  approximated  on  coarser  grids. 
But  care  should  be  taken  regarding  the  coarsest  grid: 

On  the  coarsest  grid,  an  indefinite  problem  should  be  solved 
directly,  (i.e.,  not  by  relaxation  of  any  kind.  Semi-iterative  solutions, 
like  Newton  iterations  for  non-linear  problems,  are,  of  course,  per¬ 
missible)  .  Furthermore,  this  grid  should  be  fine  enough  to  provide  rough 

approximation  to  V^,  v^,  ••«,  for  any  k,  hence  also  for  the  corresponding 

differential  eigenfunctions.  This  means  that  G°  should  contain  at 
least  0(1),  probably  2£,  points.  Or,  in  other  words,  G®  should  be  just 
fine  enough  to  still  have  smoothing  capability  at  any  finer  level  G^. 

For  example,  if  SOR  relaxations  with  ai<a)  are  used,  h  should  satisfy 
(see  [4]  or  Sec.  3  in  [3])  ^  ^ 


(4.8)  Re  {B(0,h)  /  b  (h)}  >  o,  (o  <  h  <  h  )  , 

o  —  —  o 

where  B(0,h)  is  the  symbol  of  (see  (A. 3)  in  Appendix  A)  and  b  (h)  is  its 
central  coefficient.  ^ 

Usually ,  G°  can  still  be  coairse  enough  to  have  the  direct  solution 
of  its  equations  still  far  less  expensive  than,  say,  one  relaxation 
sweep  over  the  finest  grid,  so  that  the  indefinite  problem  is  solved  with 
the  same  overall  efficiency  as  definite  problems. 


5.  THE  FAS  (FULL  APPROXIMATION-STORAGE)  ALGORITHM. 

In  the  ^AS  mode  of  the  multi-grid  algorithms,  instead  of  storing  a 
correction  v  (designed  to  correct  the  finer-level  approximation  u  "*"  ) , 
the  idea  is  to  storg  the  full  current  approximation  u^,  which  is  the  sum 
of  the  correction  v  and  its  base  approximation 
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(5.1) 


I.  ^ ,  (k«o ,  1 » . . .  f  M-1 ) . 

k+1 


In  terms  of  these  full-approx^atlon  functions,  we  ceui  rewrite  the  cor¬ 
rection  equations  (4.1-3)  as 


(5.2)  lV  =  F*', 


where 

(5.3)  -  A^d  *,  i"*^)  +  Ijjii  -  a"*^  , 


(k-0,l,...,M-l) , 

and  where  for  k-M  we  have  the  original  problem,  i.e.. 


(5.4) 


SM  «.M 
F  ■=  F  , 


For  linear  problems,  equations  (5.2-4)  are  exactly  equivalent  to 
(4.1-3).  The  advantage  of  the  PAS  mode  is  that  equations  (5.2-4)  i^ply 
equally  well  to  nonlinear  problems.  To  see  this,  consider  for  instance 

the  nonlinear  equation  -  F**  given  on  the  finest  grid.  Given  an 

approximate  solution  u^  we  can  still  improve  it  by  relaxation  sweeps, 
with  smoothing  rates  y  (varying  over  the  domain,  but  still  reliably 
estimated  by  mode  analyses,  applied  locally  to  the  linearized-freesed  equation) 
As  in  the  linear  case,  the  smoothed-out  functions  aure  the  residual 

f“  =  f“  -  lV 

euld  the  correction  U^-u**.  Therefore,  the  equation  that  can  be  approx¬ 
imated  on  coarser  grids  is  the  residual  equation 

LV-L“u“-f“. 


Its  coarser-grid  approximation  is 


u"  .  iV  f", 

N 


(5.5)  iV 

which  is  the  same, as  (5.2)  for  k»M-l.  In  Interpolating  U**  ^  (or  a  ccxuputed 
approximation  u“"^)  back  to  G  ,  we  should  actually  interpolate 

u**,  because  this  is  the  coarse-grid  approximation  to  the 
N 


Again  we  distinguish  between  the  notation  used  to  write  the  equations 
and  the  computed  approximation  u  .  Equation  (5.2)  k<M,  is  not  equival' 

ent  to  (2.2) ,  although  they  both  use  the  notation  u  . 
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M  M 

smoothed-ojit  function  U  -u  .  Similarly interpolating  an  Capproximate) 
solution  U  of  (.5,2)  to  the  finer  grid  ,  the  polynominal  inter¬ 

polation  should  operate  on  the  correction.  Thus  the  interpolation  is 


(5.6) 
which  is 


u 


k+1 


k+1  ^  ^k+1 
u  +  I  , 
k 


equivalent  to  (4,5). 
-k+1  ^  k  k+1  , 

‘  k  Vi  “ 


k+1, 
u  ) 


9 


Note  that  generally, 

k+1 

u 


The  FAS  (Cycle  C)  algorithm  is  the  same  algorithm  as  in  Sec.  4,  with 
the  FAS  equations  (5.2-4)  replacing  (4.1-3),  and  with  (5.6)  replacing  (4.5). 
It  is  flowcharted  in  Fig.  3. 

The  FAS  mode  has  several  important  advantages :  It  is  suitable  for 
general  nonlinear  problems,  with  the  same  procedures  (relaxation  and  inter¬ 
polation  routines)  used  at  all  levels.  Thus,  for  example,  only  one 
relaxation  routine  should  be  written.  Moreover,  this  mode  is  suitable  for 
composite  grids  (non-uniform  grids  created  by  increasingly  finer  levels 
being  defined  on  increasingly  smaller  subdomains;  see  Sec,  7.2)  ,  which  is 
the  basis  for  grid  adaptation  on  one  hand,  and  segmental  refinement 
(see  Sec.  7.5)  on  the  other  hand.  Generally  speaking,  the  ^asic  feature  of 
the  FAS  mode  is  that  the  function  stgreg  on  a  coarse  grid  G^  coincides  there 
with  the  fine-grid  solution:  u  =  I^  u  .  This  enables  us  to  manipulate 
accurate  solutions  on  coarse  grids. 


The  storage  required  for  the  FAS  algorithm  is  again  given  by  (4.6). 
With  segmental  refinement  (Sec.  7.5)  it  can  be  reduced  far  below  that, 
even  to  0(log  n) . 


An  important  by-product  of  the  FAS  mode  is  a  good  estimate  for  the 
truncation  error,  which  is  useful  in  defining  natural  stopping  criteria 
(see  Sec,  A. 8)  and  grid-adaptation  criteria  (Sec.  8.3).  Indeed,  for 
any  k<m^M  it  can  easily  be  shown  (by  induction  on  m,  using  (5.2-3))  that 


(5.7) 


-k  _k  -m  ^k.-k  m.  ^k  .^m  m 

F-IF=L(Iu)-ILu  , 
m  mm 

^k  _k  -m  m,  ^k  .m  m 

$-I$=A(Iu)-lAu  , 
m  mm 


m  k 

which  are  exactly  the  G  approximations  to  the  G  truncation  errors. 

A  slight_^isadvantage  of  the  FAS  mode  is  the  longer  calculation  required 
in  computing  F  ,  which  is  almost  twice  as  long  as  the  calculation  of  r  in 
the  former  (Correction-Storage)  mo^e.  This  extra  calculation  is  equivalent 
to  one  extra  relaxation  sweep  on  G^,  but  only  for  k<M,  and  is  about  5%  to 
10%  of  the  total  amount  of  calculations.  Hence,  for  linear  problems  on 
uniform  grids,  the  CS  mode  is  slightly  preferable. 
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Figure  3.  Cycle  C,  Full-Approximation-Storage. 
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_ PERFORMANCE  ESTIMATES  AND  NUMERICAL  TESTS 

6> 1>  Predictability.  An  important  feature  of  the  multi-grid  method 
in  that,  although  itera  tlve,  its  total  computational  work  cam  be  pre¬ 
dicted  in  advance  by  local  mode  (Fourier)  analysis.  Such  an  cuialysis, 
which  linearizes  and  freezes  the  equations  and  ignores  distamt  boundaries, 
gives  a  very  good  approximation  to  the  behavior  of  high-frequency  com¬ 
ponents  (since  they  have  short  coupling  range) ,  but  usually  fails  to  approx¬ 
imate  the  behavior  of  the  lowest  frequencies  (which  interact  at  long 
distance^.  The  main  point  here,  however,  is  that  these  lowest  frequencies 
may  indeed  be  ignored  in  the  multi-grid  work  estimates,  since  their  con¬ 
vergence  is  obtained  on  coarser  grids,  where  the  computational  work  is 
negligible.  The  purpose  of  the  work  on  the  finer  grids  is  only  to  converge 
the  high  frequencies.  Thus,  the  mode-analysis  predictions,  although  not 
rigorous,  are  likely  to  be  very  realistic.  In  fact,  these  predictions 
are  in  full  agreement  with  the  results  of  our  computational  tests.  (For 
rigorous  bounds  -  see  App.  C) . 

6.2.  Multi-Grid  Rates  of  Convergence.  To  get  a  convenient  measure 
of  convergence  per  unit  work,  we  define  as  our  Work  Unit  (WU)  j^e  com¬ 
putational  work  in  one  relaxation  sweep  over  the  finest  grid  G  .  The 
number  of  computer  operations  in  such  a  unit  is  roughly  wH/  ,  where  -  vi 

is  the  number  of  points  in  g”  and  w  is  the  number  of  operations  required 
to  compute  the  residual  at  each  point.  (In  parallel  processing  the  count 
should,  of  course,  be  different.  Also,  the  work  unit  should  be 
further  specified  when  comparing  different  discretization  and  relaxation 
schemes.)  If  the  mesh-size  ratio  is  p  =  ^k+i^k  problem's  domain 

is  d-dimensional,  then  a  relaxation  sweep  over  G^  ^  costs  approximately 
WUs  (assuming  the  grids  are  co-extensive ,  unlike  those  in  Sec.  7). 

Relaxation  sweeps  make  up  most  of  the  multi-grid  computational  work. 

The  only  other  pgojess  tha^  consumes  any  significant  amount  of  compu¬ 
tations  is  the  I  and  I.  ,  interpolations.  It  is  difficult  to  measure 
them  precisely  in  WUs,  but  their  total  work  is  always  considerably  smaller 
than  the  total  relaxation  work.  In  the  example  in  Appendix  B,  the  inter¬ 
polation  work  is  about  20%  of  the  relaxation  work.  Usually  the  percentage 
is  even  lower,  since  relaxing  Poisson  problems  is  particularly  ine:q>ensive. 

To  unify  our  estimates^and  measurements  we  will  therefore  define  the  multi¬ 
grid  convergence  rate  y  as  the  factor  by  which  the  errors  are  reduced  per 
one  WU  of  relaxation,  ignoring  any  other  computational  work  (which  is 
never  more  than  30%  of  the  total  work) . 

The  multi-grid  rate  of  convergence  may  be  estimated  by  a  full  local 
mode  analysis.  The  following  is  a  simplified  analysis,  which  gives  a 
good  approximation.  We  assume  that  the  relcixation  sweep  over  any  grid 

G^  affects  error  components  e^^  ^  only  in  the  range  ; — ^  <  lol  <  —  , 

\-i  -  'v 

where 

d 

C6.1)  0=  •••'  »  ©‘x  =  Z  0.  X.,  I©!  “  roax  |0.|  . 

j=l  ^  ^  ^ 
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CThe  9/h,  of  Sec.  3  amd  Appendix  A  is  denoted  here  0,  to  unify  the 
cussion  of  all  levels.!  In  fact,  if  proper  interpolation  scheme  is  used 
(see  Sec.  A. 21  only  conponents  in  the  range  [©j  ±  Cl+e)ir/  h^_^^  ,  ' 

are  affected  by  interactions  with,  coarser  grids.  But  if  proper  residual¬ 
weighting  is  also  used  (to  make  as*  It  cf.  Sec.  A. 4)  ghen  the  combine 
action  of  the  coarse-grid  correction  cycles  and  the  G  relaxation  sweeps 
yields  convergence  rates  which  are  slowest  at  |0|  =  (cf.^App.  A). 

For  such  0  the  coarse-grid  cycles  have  neutral  effect,  s^nce  a  -1,  hence 
the  convergence  rate  is  indeed  as  affected  only  by  the  G  relaxation 
sweeps . 

One  relaxation  sweep  over  g'^  reduces  the  error  components  in  the 
— L_  <  1 0 1  £  —  by  the  smoothing  factor  y .  (See  Sec .  3 .  If  the 
\-l  "  ^k 

smoothing  rate  near  a  boundary  is  slower  than  y,  which  is  not  the  usual 
case,  smoothing  may  be  accelerated  there  by  partial  relaxation  sweeps 
cf.  Sec.  A. 9.)  Thus  a  multi-grid  cycle  with  s  relaxation  sweeps  on 
each  level  reduces  all  error  components  by  the  factor  y  .  The  amount  of 
work  units  expended  in  these  sweeps  is 


Ad  ^  A2d  . 
s  +  sp  +  sp  + 


A(M-l)d  ,  s 
1-p 


Hence,  the  multi-grid  convergence  rate  is 


(6.2) 


o  - (1-p  ) 

y  =  y 


which  is  not  much  bigger  than  y.  In  case  a>l,  the  effective  smoothing  rate 
(see  (A. 8))  should  replace  y  in  this  estimate. 

Estimate  (6.2)  is  not  rigorous,  but  is  simple  to  compute  and  very 
realistic.  In  fact,  numerical  experiments  (Sec.  6.4-5)  usually  show  slightly 
faster  (smaller)  rates  y,  presumably  because  the  worst  combination  of 
Fourier  components  is  not  always  present. 

The  theoretical  multi-grid  convergence  rates,  for  various  represent¬ 
ative  cases,  are  summarized  in  Table  1. 

BxDlanations  to  Table  1.  The  first  column  specifies  the  difference 
operator  and  the  dimension  d.  A  denotes  the  central  second-order 
( (2d+l) -point)  approximation,  anS  A  the  fourth-order  ( (4d+l) -point  s  ar 
appro»iition,  to  the  L«.leoe  operator,  is  the  central  X3-point 

mation  to  the  biharmonic  operator.  The  operators  3^,  9  ,  9  and  ^  “e  tne 
usual  central  second-order  approximations  to  the  corresponding  partial- 
differential  operators.  9"  is  the  backward  approximation.  Upstream  dif¬ 
ferencing  is  assumed  for  tiie  inertial  terms  of  the  Navier  stokes  equations? 
central  differencing  for  the  viscosity  terms,  forward  differencing  for  the 
pressure  terms,  and  backward  differencing  for  the  continuity  equation. 

Rh  is  the  Reynolds  number  times  the  mesh-size. 

The  second  column  specifies  the  relaxation  scheme  and  the  relaxation 
parameter  u).  SOR  is  Successive  Over  Relaxation,  vdiich  for  u)=l  is  t  e 
Gauss-Seidel  relaxation.  xLSOR  CyLSOR)  is  Line  SOR,  with  lines  in  the 
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TABLE  1.  Theoretical  smoothing  and  MG-convergence  rates 


d 

Relax .  Scheme  u 

ID 

8 

mm 

add  xnuH 

M 

1 

1 

SOR  1 

.557 

.693 

2.73 

2 

1 

IBl 

EE 

,477 

.668 

2.49 

3 

2 

6.9 

n 

.378 

.723 

3.08 

3 

2 

7.5 

2 

SOR  1 

.  1:3 

.667 

.697 

2.77 

4 

1 

6.8 

.8 

1  1:2 

.552 

.640 

2.24 

5 

2 

4.1 

1 

.500 

.595 

1.92 

4 

1 

3.5 

1*2 

.552 

.640 

2.24 

5 

2 

4.1 

1 

2:3 

.400 

.601 

1.96 

4 

1 

2,9 

LSOR  1 

EE 

1,66 

8 

4 

3.1 

ADLR  1 

1.40 

8 

4 

2.6 

.8 

■ 

1.70 

8 

4 

3.1 

SD  .8 

Bw 

.682 

5 

2 

4.8 

WSD  1,17,  ,195 

■ 

.321 

9 

3 

1.6 

1,40,  ,203 

■ 

,600 

1.96 

9 

3 

3.6 

3 

SOR  1 

1:3 

.738 

.746 

3.42 

mm 

7.8 

1:2 

.567 

.608 

2.01 

o 

3.7 

2:3 

,441 

.562 

1.73 

a 

2.0 

2 

SOR  .8 

B 

.665 

2.46 

9.1 

1 

.625 

2.13 

8 

2 

7.9 

1.2 

.666 

2.46 

9 

3 

9.1 

LSOR  1 

1  .484 

.580 

1.84 

14 

7 

6.8 

3 

SOR  1 

■ 

.596 

.636 

2.21 

12 

2 

7.0 

3  +  23  3  +  9 

XX  X  y  yy 

2 

SOR  1 

ffl 

.  62 

.699 

2.79 

5.2 

LSOR, ADLR 

■ 

,447 

.547 

1.66 

3.1 

2 

SOR  1 

.847 

6.04 

B 

'  11.7. 

1 

.798 

4.43 

In 

6.5 

WSD  1.552,  .353 

;  .638 

2.22 

4.1 

1,4  ,  ,353 

1.03 

div. 

div. 

div. 

WSDA  1,552,  .353 

.549 

.638 

2.22 

4.1 

NAVIER  -  STOKES 

CSOR 

■ 

■■ 

o 

II 

2 

downs tr.  1,  .5 

.846 

5.98 

j^B 

11.0 

any 

1r  .5 

■ 

.846 

5.98 

Wm 

11.0 

100 

1.1,  .5 

div. 

div. 

i9 

div. 

100 

.8,  .5 

■ 

.93 

.947 

18.4 

EH 

34.0 

10 

upstream  1,  .5 

■ 

.884 

.912 

10.8 

33 

16 

20.0 

100 

1,  .5 

■ 

.994 

.995 

220. 

33 

16 

100. 

100 

.8,  .5 

■ 

.984 

.988 

83, 

33 

16 

L50. 

0 

3 

■ 

.845 

.863 

6.79 

33 

8 

10.7 

any 

llllllliillllll^^ 

■ 

.845 

.863 

6.79 

60 

25 

10,7 

10 

■ 

.874 

.889 

8.49 

60 

25 

100 

HHHEBE 

■ 

.989 

.990 

100. 

60 

25  ] 

gB|| 
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TABLE  1. 


(Cont'd.  Here  d=2,  p=l:2) 


Relax. 

Scheme 

y 

3  +  e9  ,  e«l 

XX  yy 

SOR,  xLSOR  any 

1-  0(e) 

a3  +  c3 

XX  yy 

(q  =  inin(^  ,  j) ) 

yLSOR  1 

ADLR 

-  a+2c’ 

SD,  yLSD,  ADLSD  1 

SD  (2q+2)/(3q+2) 

yLSD  (2a+2c)/(2a+3c) 

ADLSD  2/3,  2/3 

1 

(q+2)/(3q+2) 

(2a+c)/(2a+3c) 

=  .577 

A,  -  5-  3 

h  h  X 

yLSOR  1 

A,  -  ^  a” 

h  h  X 

(n>o) 

yLSOR+  1 

yLSOR- 

yLSORs 

max  ,  [5+6ri+2Ti  ]  ^  ^ 

,1  1  i+n  1 V 

max  (3  ,l2+n+i 

£  =  .577 

Navier  -  Stokes 
with  large  Rh  in 

2  or  3  dimensions 

SOR  (pressure 
corrected  by  the 
continuity  equation) , 
downstream  or  up¬ 
stream,  with  any 
relaxation  parameters. 

X  (y)  direction.  yLSOR+,  yLSOR-  a,nd  yLSORs  indicate,  respectively*  relax¬ 
ation  marching  forward,  backward  and  symmetrically  Calternately  forward 
and  backward).  C30R  means  Collective  SOH  (see  Sec.  3  in  [3])  and  the  attached 
oj's  are  for  the  velocity  components  and  for  the  pressure.  ADLR  denotes 
Alternating  Direction  Line  Relaxation  (a  sweep  of  xLSOR  followed  by  a  sweep 
of  yLSOR) ,  SD  is  Simultaneous  Displacement  CJacobi)  relaxation,  WSD  is 
Weighted  Simultaneous  Displacement  with  the  optimal  weights  as  specified  in 
Sgc.  3,3  above  (and  with  other  weights,  to  show  the  sensitivity).  WSDA  (for 
^h?  like  WSD,  except  that  residuals  are  computed  in  less  operations  by 
m^ing  first  a  special  pass  that  computes  A  u.  yLSD  is  y-lines  relaxation 
with  simultaneous  displacement,  ADLSD  is  the  corresponding  alternating-direction 
(yliSD  alternating  with  xLSD)  scheme. 

The  next  colvunns  list  p  =  h  :  h  (see  discussion  below) ,  the 
smoothing  rate  y  as  defined  by  (3.8) ,  and  the  multi-grid  convergence  rate  y, 
ccJculated  by  (6.2).  We  also  list  | log  y|“  ,  which  is  the  theoretical 
number  of  relaxation  Work  Units  required  to  reduce  the  error  by  the  factor  e, 
and  W^,  the  overall  multi-grid  computational  work  (see  Sec.  6.3).  To  make 
comparisons  of  different  schemes  possible,  we  also  list,  for  each  case,  the 
number  of  operations  ger  grid  point  per  sweep.  This  number  times  n  (the 
m^er  of  points  on  G  )  give  the  number  of  operations  in  a  Work  Unit.  We 
list  only  the  basic  number  of  additions  and  multiplications  (counting  shifts 
as  multiplications) ,  thus  ignoring  the  operations  of  transferring  information, 
indexing,  etc.,  which  may  add  up  to  a  significant  amount  of  operations,  but 
which  are  too  computer-  and  programr dependent  to  be  specified.  Also,  we 
assmed  that  the  right-^Md  sides  f  ,  including  f^,  are  stored  in  the  most 
efficient  form  (e.g.,  h  f  is  actually  stored).  Note  that  the  SOR  operation 
count  is  smaller  for  aj=l  (Gauss -Seidel)  than  for  any  other  o). 

Numbers  in  this  table  were  calculated  by  Allan  S.  Goodman,  at  IBM  Thomas 
J.  Watson  Research  Center.  A  more  extensive  list  is  in  preparation. 

Mgsh-size  ratio  optimization.  Examining  Table  1,  and  many  other  unlisted 
examples,  it  is  evident  that  the  mesh-size  ratio  p  =  1:2  is  close  to  optimal, 
yielding  almost  minimal  | log  y|  and  minimal  W  .  This  ratio  is  more  con¬ 
venient  and  more  economic  in  the  interpolation  processes  (which  are  ignored 
in  the  above  calculatj^ons)  than  any  other  efficient  ratio.  In  practice, 
therefore,,  ratio  p  =  1:2  should  always  be  used,  giving  also  a  very  desirable 
standardization . 

__Qver-AIl  Multi-Grid  Computational  Work.  Denote  by  W  the  com- 

M 

putational  work  (in  the  above  Work  Units)  required  to  solve  the  G  pro¬ 
blem  ((2.2),  k=M)  to  the  level  of  its  truncation  errors  (cf.  Sec.  A. 8). 

If  the  problem  is  first  solved  on  G  to  the  level  x  ,  and  if  the  cor¬ 
rect  order  of  interpolation  is  used  to  interpolate  the  solution  to  G^  (so 
that  unnecessary  high-frequencies  are  not  excited;  cf.  Sec.  A. 2,  and  in 
particu^aj  (A. 7)  for  i=l)  then  the  residuals  of  this  first  G  approximation 
are  0  Cx  ) .  The  computational  work  required  to  reduce  them  to  0  tx  )  is 

log  OCx^/x  ^)  /  log  y  .  Hence, 
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(6.3) 


M 


J  ,  o 

/  log  ]i 


Similarly,  we  can  solve  the  problem  expending  work. 

_M-j 


(6.4) 


.M-j 


Ajd 
+  0-^ 

Ajd 


Vj  ■  Vi-1  ^ 


/  ^  o 
/  log  y 


-M 


(since  a  G“  ''  work  unit  is  p"'“  times  the  G  unit) 
proximation ,  then 


If  we  use  p-order  ap- 


(6.5) 


0(h/) 


k-1 


=  o(p  ) . 


Hence,  using  (6.4)  for  j-0,1,2 , . . .M-1  and  neglecting  W^, 


W  < 
M  - 


(l+p‘^+p^‘^+...)  p  log  P  /  log  °  . 


or,  by  (6.2)  , 


p  log  p 


(1 


p*^)^  log  V 


(The  same  p  was  assumed  in  computing  the  first  approx^ation  and  in 
improvement  cycles.  This  of  course  is  not  necessary.) 

Tvoical  values  of  this  theoretical  W  are  shown  in  Table  1  above.  In 
actual^omputations  a  couple  of  extra  WorK 

a  problem,  because  we  cannot  make  non-integral  numb  level  of 

MG  cycles,  and  also  because  we  usually  solve  to  accuracy  below  the  level  of 

the  truncation  errors. 


pnr  -i-ooints  Poisson  problems,  for  example,  the  following  procedure  gives 


M 

a  G^  solution  with  residuals  smaller  than  t 


(1)  Obtain  u^  ^  on  G**  with 


residuals  smaller  than  t“-\  (11)  Starting  with  the  2*10  interpolation  u"  »  I.  "-u" 


M-1 


(preferably  by  using  the  difference  operator  itself;  cf .  [7] )  ,  make  a  MG  correc 

tion  cycle  such  as  Cycle  C  with  n=o  (i.e.,  switching  to  G  ^after  two  sweeps 
on  G^) ,  with  I’^'^transfer  by  injection  (cf.  Sec.  A. 4)  and  by  linear 

interpolation,  and  with  "convergence"  on  G^  defined  as  obtained  after  the 
first  sweep  following  a  return  from  A  precise  count  shows  Step  (ii)^ 

to  require  30n  +  0(n^'^^)  operations,  where  n  is  the  number  of  points  in  G 
Thus,  the  total  number  of  operations  is 


n  +  i  +  ^  +  ,..)30n  +  0(n^'^^ 
4  16 


)  40n  +  Otn  '  1 


1/2 


incidentally,  none  of  these  operations  is  a  full  multlplloatlo^  o^y 
additions  and  shifts  (multiplications  or  divisions  by  2  or  4)  are  us 
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The  theoretical  «or  this  problem  (sixth  line  in  Table  11  amounts  to  only 
17. 5n  operations,  since  it  ignores,  interpolation  work  ClO.Sn  operations  in 
the  above  procedure)  and  allows  non- integral  numbers  of  sweeps  and  cycles. 

In  fact,  numerical  tests  showed  the  above  algorithm  to  yield  residuals  con¬ 
siderably  below  the  truncation  errors.  (The  only  cases  in  vrtiich  the  residuals 
approached  50%  of  the  truncation  errors  were  cases  with  high  smoothness,  in 
which  the  correct  MLAT  discretization  would  be  different;  namely,  of  higher 
order,  (df.  Sec.  8  and  the  remark  following  formula  (A. 7).) 


6.4.  Numerical  Experiments;  Elliptic  Problems.  A  typical  numerical 
e;^eriment  is  shown  in  Appendix  B,  including  the  FORTRAN  program  and  the 
computer  output.  The  output  shows  a  multi-grid  convergence  rate 


1 

o  J  .009051  ] 

^  \  28.1  / 


=  .537 


which  is  close  to,  and  slightly  faster  than,  the  theoretical  value  y  =  .595 
shown  in  Table  1. 


Many  numerical  experiments  with  various  elliptic  difference  equations 
in  various  domains  were  carried  out  at  the  Weizmann  Institute  in  1970-1972, 
with  the  collaboration  of  Y,  Shiftan  and  N.  Diner.  Some  representative 
results  were  reported  in  [2] ,  and  many  others  in  [11] .  These  e3q)eriments 
were  made  with  other  variants  of  the  multi-grid  algorithm  (variants  A  and 
B) ,  but  their  convergence  rates  agree  with  the  same  theoretical  rates  y. 

The  esiperiments  with  equations  of  the  form  aU  +  cU  ^  with  a>>c,  showed 
poor  convergence  rates,  since  the  relaxation  ^heme  X¥ed  was  Gauss-Seidel , 
and  not  the  appropriate  line  relaxation  (cf.  Sec.  3.1).  Some  of  these 
rates  were  better  than  predicted  by  the  mode  analysis,  because  the  grids 
were  not  big  enough  to  show  the  worst  behavior.  The  convergence  rates 
found  in  the  e^q^eriments  with  the  biharmonic  equation  were  also  rather 
poor  (although  nicely  bounded  away  from  1,  independently  of  the  grid  size), 
again  because  we  used  Gauss-Seidel  relaxations  and  injections  instead  of 
the  appropriate  schemes  (cf.  Sec.  3.3  and  A.4) .  All  these  points  were 
later  clarified  by  mode  analyses ,  which  fully  explain  all  the  experimental 
results.  In  solving  the  stationary  Navier-Stokes  equations,  as  reported 
in  [2] ,  SOR  instead  of  CSOR  was  employed  (cf .  table  1  above) ,  and  an  additional 
over-simplification  was  done  by  using,  in  each  multi-grid  cycle,  values  of  the 
nonlinear  terms  from  previous  cycle,  instead  of  using  the  FAS  scheme  (Sec.  5). 

Nevertheless,  these  experiments  did  clearly  demonstrate  important 
features  of  the  multi-grid  method:  The  rate  of  convergence  was  essentially 
insensitive  to  several  factors,  including  the  shape  of  the  domain  U,  the 
right-hand  side  F  (which  has  some  influence  only  at  the  first  couple  of 
cycles;  cf.  Sec.  A. 2)  and  the  finest  mesh-size  h  (except  for  mild  varia¬ 
tions  when  is  large) .  The  experiments  indicated  that  the  order  I  of 

k 

the  interpolations  should  be  the  order  of  the  elliptic  equation,  as 

shown  in  Sec.  A. 2  below.  (Note  that  in  I2J  the  order  was  defined  as  the 
degree  A  of  the  polynominal  used  in  the  interpolation,  whereas  here 
I  =  f+1.) 


More  numerical  experiments  are  now  being  conducted  at  the  Weizmann 
Institute  in  Israel  and  at  IBM  Research  Center  in  New  York,  and  will 
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he  reported  elsewhere.  We  will  briefly  report  here  only  an  extreme  case 
of  the  multi-grid  tests  -  the  solution  of  transonic  flov  problems. 

6  5  Mnmf^rical  Experiments  ;  Transonic  Flow  Problems.  These  e^eri- 
ments  ^re  conducted  in  1974  at  the  Weizmann  Institute  with  J.L.  Fuchs, 
and  recently  at  the  NASA  Langley  Research  Center  in 

Dr.  Jerry  South  while  the  present  author  _ 

for  Computer  Applications  in  Science  and  Engineering  (ICASE) .  They  are 
preliminarily  reported  in  [12]  ,  and  will  be  further  f 

One  purpose  of  this  work  was  to  examine  the  performance  of  the  multi  grid 
method  in  a  problem  that  is  not  only  nonlinear,  but  more  significant  y, 
is  also  of  mixed  (elliptic-hyperbolic)  type  and  contains  discontinuities 

(shocks) 


We 


considered  the  transonic  small-disturbance  equation  in  conservation 


form 


(6.7) 


((K-ic+  )  *^1^  +  c  ♦  .  o. 


for  the  velocity  disturbance  potential  * (x,y)  outside  an  airfoil.  Here 

is  the  free-stream  Mach  number. 


K  =  (1-M  ^)  / 


ic  =  I  (Y+l) 


M 


and  y=1-4  is  the  ratio  of  specific  heats,  t  is  the  airfoil  thickness 
ratio,  assumed  to  be  small.  c=l,  unless  the  y  coordinate  is  stretched.^ 
The  airfoil,  in  suitably  scaled  coordinates,  is  located  at  ly-0,  lx|  _  , 

and  we  consider  nonlifting  flows,  so  that  the  problem  domain  can,  by 
symmetry,  be  reduced  to  the  half-plane  {y>o},  with  boundary  conditions 


(6.8)  <t>(x,y)  o 


2  2 

as  X  +y  ® 


(6.9)  4>y(x,o)  = 


for  I X I  >  ^  , 

for  1 


<  I- 


where  t  F(x)  is  the  airfoil  thickness  function  which  we  took  to  be 
parabolic.  Equation  (6.7)  is  of  hyperbolic  or  elliptic  type  depending 
on  whether  K  -2K(|.^  is  negative  or  positive  (supersonic  or  subsonic) . 

The  difference  equations  we  used  were  essentially  the  Murman's  con¬ 
servative  scheme  ([9];  for  a  recent  account  of  solution  methods,  see  [8]), 
where  the  main  idea  is  to  adaptively  use  upstream  differencing  in  the 
hyperbolic  region  and  central  differencing  in  the  elliptic  region,  keep¬ 
ing  the  system  conservative.  For  relaxation  we  used  vertical  (y)  ine 
relaxation,  marching  in  the  stream  direction.  The  multi-grid  solution 
was  programmed  both  in  the  CS  (Sec.  4)  and  the  FAS  (See.  5)  modes, 
with  practically  th^_|ame  results.  We  used  cubic  interpolation  for  I  ^ 

and  injection  for  I  ,  . 
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Local  mode  analysis  of  the  linearized-freezed  difference  equations 
and  vertical-forward  line  relaxation  gives  the  smoothing  rate 


(6.10) 


y  =  max 


{ 


b^+b^+ib 


2c+b^J  ' 


b^(x)  =  K  -  2K  , 


at  elliptic  (subsonic)  points,  and  y  =  o  at  supersonic  points.  We  were 
interested  in^cases  where  K<1  and  ^  ^o,  and  hence,  in  smooth  elliptic 

regions  (b  -V/  b_)  without  coordinate  stretching  we  get  y  ^  1/12+^  =  0.45 

and  y  =  y  '  =  0.55. 

The  actual  convergence  rates,  observed  in  our  experiments  with  mod¬ 
erately  supercritical  flows  (M^  =  0.7  and  M  =0.85,  t  =  0.1)  on  a 
64x32  grid,  were  y  =  0.52  to  0.53,  just  slightly  faster  than  the  theor¬ 
etical  value.  (See  detailed  output  in  [12].  The  work  count  in  [12]  is 
slightly  different,  counting  also  the  work  in  the  transition). 

For  highly  supercritical  flows  (M^  =  0.95,  t  =  0.1)  the  MG  convergence 
rate  deteriorated,  although  it  was  still  3  times  faster  than  solution  by 
line  relaxation  alone.  The  worse  convergence  pattern  is  explainable  in 
terms  of  the  mode  analysis  for  the  elliptic  region  immediately  behind 
the  shock,  where  b^  »  b_,  yielding  y  closer  to  1.  Also,  the  fast  changes 
in  ((.^  in  that  region  gives  a  >  1  (see  Sec.  A.l),  i.e.,  the  coarse  grid 
cycles  actually  magnify  the  Fourier  component  with  6  =  (|-  ,  0) ,  the  same 
component  for  which  y  is  closer  to  1.  This  worse  behavior  in  this  re¬ 
stricted  region  further  affected  our  computations  because  we  did  not  use 
separate  stopping  tests  for  this  region  as  we  should  (see  Sec.  A, 6).  A 
correct  multi-grid  algorithm  for  this  problem  should,  therefore,  include 
symmetric  selective  line  relaxation  (see  Sec.  3.2),  or  partial  relaxation 
sweeps  (see  Sec.  A.9) ,  or  both,  in  addition  to  residual  weighting  (Sec.  A. 4). 

Coordinate  stretching,  which  transforms  the  bounded  computational 
domain  to  the  full  half  plane,  gave  difference  equations  that  again 
exhibited  slow  multi-grid  convergence  rate.  This,  too,  is  explainable 
by  the  mode  analysis.  For  example,  in  the  regions  where  the  y  coordinate 
is  highly  stretched,  c  in  (6.7)  becomes  very  small  and  hence  y  in  (6.10) 
is  close  to  1.  The  theoretical  remedies:  alternating-direction  line  re¬ 
laxations  and  partial  relaxation  sweeps.  The  latter  was  tried  in  one 
simple  situation  (stretching  only  the  x  coordinate) ,  and  indeed  restored 
the  convergence  rate  of  the  corresponding  unstretched  case. 


7.  NON-UNIFORM  GRIDS. 

Many  problems  require  very  different  resolution  in  different  parts  of 
their  domains.  Special  refinement  of  the  grid  is  required  near  singular 
points,  in  boundary  layers,  near  shocks,  and  so  on.  Coarse  grids  (with 
higher  approximation  order)  should  be  used  where  the  solution  is  smooth, 
or  in  subdomains  far  from  the  region  where  the  solution  is  accurately 
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needed,  etc.  A  general  method  for  locally  ”!=^:=tnf"?^e*Sthod 

imation  orders  is  described  in  Sec.  8.  An  important  feature  of  the  me^oa 
is  adaptivity:  the  grid  may  change  during  the  solution  process,  adap  g 

5^  evolving  eojion.  In  thin  section,  we  jj,^nle. 

organising  non-uniform  grids  so  that  the  J!  („£. 

The  main  idea  is  to  let  the  sequence  of  unrfom  grids  G  ,G^, 

Sec.  2)  be  open-ended  and  to* produce  higher  local  refinement, 

rdio:;strn:rj^"s-roS“^^^^ 

process- remains  practically  as  before  Sec.  5) ,  with  similar  efticie 

J^:;^irar“fii:m:n:rf Tm  sunstantlany 
reduced  storage  requirement. 

n  1  nroanizina  Non-Uniform  Grids.  How  are  general  non-uniform _ 

grids  factual  oomputatio;^!  "‘“V'Is  W  S^Se’" 

One,  usually  used  with  the  finite  ne  practically  anywhere, 

system  very  flexible,  allowing  each  grid  locations  and 

This  reauires  a  great  deal  of  bookkeeping:  grid-points  locations  an 

rui“-;iiS  r  Ssrcfi5;u:ir  Sd  ^3-- 

::Tt:rthrs^«“i:-.‘'“i  7.3 

multi-grid  solution  on  a  completely  geheral  grid  (see,  however, 
and  A. 5),  and  conjJlete  generality  is  not  necessary 
desired  refinement  pattern. 

Another  approach  for  organizing  a  non-uniform  grid  is  ®  <-rans- 

coordinate  transformation,  with  ®  “J^oi^lly^stiirLrtLgSlL,.  thellSIIti- 
formed  domain.  On  such^rids,  topologically  still  rectan^i^  ^,^.5 

grid  method  «h  '>*  Siteria,  residual  weighting, 

:riSa?r^ra^rrei.natio„ 

Lrno?“?f:rr„oXfi:-ui:y^,i3S;u^^^^  r  “ 

produce,  unless  it  is  a  one-dtaensronal  '^“25?i™lties  ate  enlarged 

“rtr  u:::ny^mLr  tJ:  rffrrp^^^^^^ 

does  become  sophisticated  ®  particular  if  higher-order  approx- 

one-dimensional  transformations) ,  and  in  p 
imations  should  be  used  in  some  or  all  subdomains. 

nsintTef  ha  it  in  the  original  or  in  some  transformed  domain,  one  would 

like  To  have  a  convenient  system  for  local  "difference 

i^LLping  and  efficient  methods  for  formulat^g 

equations.  The  following  system  is  proposed  (and  then  generaiiz 

in  Secs.  7.3,  7.4) : 
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A  non-uniform  grid  is  a  union  of  uniform  sub-gridS/ 
with  corresponding  mesh-sizes  h  ,  h  Usually  Ir  =  2ii'and 

every  other  grid  line  of  G  is  a  grid  liiie  of  G^.  Unlike  the  descrip¬ 
tion  in  Sec.  2,  however,  the  sub— g^ids  are  not  necessarily  extended  over 
tjie  same  domain.  The  domain  of  G  may  be  only  part  of  the  domain  of 
G  (but  not  vice  versa) .  Thus  we  may  have  different  levels  of  refine¬ 
ment  at  different  subdomains. 

For  problems  on  a  bounded  domain  several  of  the  first  (the  coars¬ 
est)  sub-grids  may  extend  to  the  entire  domain  a.  That  is,  they  do  not 
serve  to  produce  different  levels  of  refinement,  but  they  are  kept  in 
the  system  fog  serving  in  the  multi-grid  process  of  solving  the  difference 
equations.  G  should  be  coarse  enough  to  have  its  system  of  difference 
equations  relatively  inexpensive  to  solve  (i.e.,  requiring  less  than 
O(znj^)  operations,  where  n^^  is  the  number  of  grid  points  in  G^.  But  cf. 

Sec.  4.1).  The  finer  sub-grids  typically  extend  only  over  certain  sub- 
domains  of  (2,  not  necessarily  connected.  Generally,  G^  is  stretched  over 
those  subdomains  where  the  desired  mesh-size  is  h,  or  less.  Thus,  very 

fine  levels  (e.g, ,  with  M=20,  so  that  h  =2  h  )  may  be  introduced,  provided 
they  are  limited  to  suitably  small  subdomains. 

Such  a  system  is  very  flexible,  since  grid  refinement  (or  coarsening) 
is  done  by  extending  (or  contracting)  uniform  sub—grids.  There  are  several 
possible  ways  of  storing  functions  on  a  (possibly  disconnected)  uniform 
grid,  allowing  for  easy  grid  changes.  For  example,  each  string  (i.e.,  con¬ 
nected  row  or  column)  of  function  values  can  be  stored  separately,  at  an 
arbitrary  place  in  one  big  storing  area,  with  a  certain  system  of  pointers 
leading  from  one  string  to  the  next.  The  extra  storage  area  needed  for 
these  pointers  is  small  compared  with  the  area  needed  for  storing  the 
function  values  themselves.  One  such  system,  with  subroutines  for 
creating,  changing  and  interpolating  between  the  grids,  is  now  under 
construction,  and  will  be  reported  elsewhere. 

If  the  (original  or  transformed)  problem* s  domain  is  unbounded,  we 
usually  put  suitable  boundary  conditions  on  some  finite,  ”far  enough” 
artificial  boundary.  In  the  present  system,  we  do  not  have  to  decide 
in  advance  where  to  place  the  artificial  boundary:  We  can  extend  (or 
contract)  the  coarsest  sub-grid (s)  as  the^solution  evolves.  Moreover, 
we  can  add  increasingly  coarser  levels  (G^^,  g"^,...)  to  cover  increasingly 
wider  domains,  if  required  by  the  evolving  solution.  In  this  way,  we  may 
reach  computational  domains  of  large  diameter  R,  by  adding  only  O(log  R) 
grid  points  (assuming  the  desired  mesh-size,  out  at  distance  r,  is  pro¬ 
portional  to  r,  or  larger.  This  should  usually  be  the  case,  especially 
if  appropriate  higher-order  approximations  are  used  at  large  distances) . 

There  appears  to  be  a  certain  waste  in  the  proposed  system,  as  one 
fxinction  value  may  ^e  stored  several  times,  when  its  grid  point  belongs 
to  several  levels  G  .  This  is  not  the  case.  First,  because  the  amount 
of  such  extra  storage  is  small  (less  than  2’^  of  the  total  storage;  see 
(4.6)),  Moreover,  the  stored  values  are  exactly  those  needed  for  the 
multi-grid  process  of  solution;  In  fact,  in  that  process,  the  values 
stored  for  different  levels  at  the  same  grid-point  are  not  identical, 
they  only  converge  to  the  same  value  as  the  process  proceeds. 
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7  2  The  Multi-Grid  Algorithm  on  Non-Uniform  Grids.  The  following 
is  a  description  of  the  modification  in  the  FAS  multi-grid  algorithm  (Sec. 

off  nc„-u„lfom  grid  with  the  .bove  structure.  The  a  gorr^m 
reueihs  almost  the  same,  except  that  the  difference 

are  changed  to  take  account  of  the  fact  that  the  Revels  G  •  ^  of 

f  ceSfrKy  fver  the  same  domain.  Denoting  by  <  the  set  of  points  of^ 

Which  are  'kf-Sfrd'L^^ 

difference  equations  are  derigea  ,  see  rj.yuj.c 

the  difference  equations  on  G  is 


5) 


(7.1) 
where 

(7.2) 

(7.3) 

(7.4) 

(7.5) 


' 


sk  _k 

F  =  F 

and 

$  =s 

in 

G^-G  and  for  k=M, 

k-H 

sk  „  k 

^  =  ^k-Hl 

and 

Tk 
$  = 

$  ^ 
k+1 

in 

G  k 

S-H  ' 

=  I  (F  -  L  u  )  +  L  (I  u  )  , 

m  m 

$1^  =  ($“  -  M  +  A’^dV)  . 

mm  ™ 


approximation  to  the  original  right- 


and  ,  as  in  Sec.  2,  are  the  G 
hand  sides  F  and  respectively. 

Observe  that,  by  (7.2-3),  each  inter^diate 
role:  on  the  an^^he  difference  equation  there  is  an 

aJprLimJtion%o  the  original  differential 

on  the  subdomain  where  finer  sub-grxds  „ot  confused 

calculating  the  -arse-grid  corr^txon.^^^ese  i^,i,itly 

owing  to  the  FAS  mode,  in  which  t  terms  of  the  full  approx- 

?  my  ba.regardsd  as  the  usjial  G  ri^t- 

hand  side 
Indeed 


iffhfr™  L  f  ;rrded  as  the  usgal  g'  right- 

*  k^^  otner  wu  '  m  .  ^  in  the  G  solution. 

(F^)  ,  corrected  to  achieve  G  accuracy  in  me  ii. 


(7.6) 


_k 

F  - 
m 


jk  f"'  = 

m 


b^iV)  - 

in 


k  iHt 

I  (L  u  ) 
m 


which  is  the  G 


m 


approximation  to  the  G^  truncation  error. 


3 - -  imr-"  and  not  "interior",  because  these  points  may 

^  We  use  the  term  inner  '  ^nd  boundary  points  difference  equations 

well  be  boundary  points.  Indee  ,  _  ^  called  boundary  con- 

are  defined,  although  they  are  of  a  equations  are  not  de- 

dilions.  The  only  g"*  points  where  G  ^m.  .  ^he  boundary 

fined  are  points  on  or  near  in  e^^  coarser  levels  are.  If 

beyond  which  the  lej/el  G  is  not  define^  ^Lrof  g"',  g’^  is  defined 

a-i,  xww-i^  Hr„a.5  of  G  do  not  coincide  with  grid  lines  oi.  vj  , 

If  of  points  of  g"  to  whicj  proper  interpolation  from  inner  points 

Of  g"*  is  well-defined.  For  m>M,  G^  is  empty. 
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Figure  4.  Exan^jle  of  Non-Uniform  Grid. 

A  section  of  the  domain  H  and  its  bo\andary  3£2  is  shown, 

k 

covered  with  a  coarser  grid  G  (line  intersections)  and  a 
k+1 

finer  grid  G  (crosses  and  circles) .  For  the  case  of 
a  5— points  (or  9~points  "box")  difference  equations , 
inner  points  are  nicirked  with  crosses,  its  outer  points  with 
circles.  (For  convenient  interpolation,  outer  points  should 
lie  on  G  lines).  At  outer  points  belonging  to  G^,  the 
converged  solution  satisfies  the  G  difference  equations, 
such  as  the  5-point  relations  indicated  by  squares.  At 
other  outer  points,  such  as  those  shown  with  triangles,  the 
solution  is  always  an  interpolation  from  values  at  adjacent 
G  points.  (Note  that  starting  values  at  outer  points  should 
be  such  that  these  interpolation  relations  are  satisfied.  The 
FAS  interpolation  steps  will  then  automatically  preserve  these 
relations . ) 
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The  only  other  modification  required  in  applying  Cycle  C 
to  non-uniform  grids  is  in  the  convergence  switching  criteria. 
See  Sec,  A. 10. 


When  converged,  the  solution  so  obtained  satisfies  sedations  (2.2) 

in  the  inner  part  of  ^  -  G^_^.  ,  (k=0,l . M)  .  On  out«  (i.e.,  non- 

inner)  points  the  solution  Automatically:  satisfies  J 

difference  equation  (if  the  point  belongs  to  a  coarser  grid)  or  a  coarser 

rilatlon."  ,S«  Pipure  4,  Note  th.t  In 
difference  equations  should  be  defined  on  uniform 

important  advantage.  Difference  equations  on  _  the 

much  simpler,  more  accurate.  The  basic  ,g  5^  ^an  be  read 

weights  (1,2,1)  for  the  secord  order  approximation  to  3  /3x  ^  c^  be 

from  small  standard  tables;  whereas  on  a  f'f and  they  are 
should  be  recomputed  (or  stored)  separately  or  P 

very  complicated  for  high-order  approximations. 


J.  j.  - ^  - 

Another  advantage  in  that  the  relaxation  nwe.epa  ,  W.  Me 

□rids  onlv.  Ihis  simplifies  the  sweeping,  and  is  particula  P  P 
SfrgSetrio  and  altematlng-direotion  sweeps  are  required  (of.  Sec.  3). 

NumerloaleiEsriment.  indicate  that  the  typical  multi-gri^  c^vergence 

rates,  measured  by  the  “;/“t”lLd“n  multi-grid 

fifuSoS  “rS  SL  '  Th--  ^::r;n1u“, 

^o™on  oS,1iM:  h^^^T^e  up  only  a  small  palTof  the  points 
of  the  final  non-uniform  grid. 


i  1  . . elements  Generalisation,  me  structure  and_^eomtion 

Strictly  uniform,  levels. 


strxcrxy  - - 

Quite  often,  especially  m  '^^^^g'^^oarsest  triangulation) 

discretizations,  the  "basic^^^  Tarticularly  suitable 


discretizations,  j^^gne,''but  one  whi^  is  particularly  suitable 

of  the  domain  is  a  non  unirorm  one,  1  ^2  defined  as 

for  the  geometry  of  the  problem  Finer  levels  G  ,G  ,,^.,  are 

uniform  refinements  of  that  basic  level!  e.g.,  h^  n^,,  X 

is  constant  within  each  basic  element. 


J  ir-  A  f-iio  Ifivels  in  this  manner,  the  rest  may  in 
Having  defined  the  levels  ^  r  only  certain, 

principle  be  as  e?T^e  actual  subgrids  G  need  not 

arbitrary  portions  of  g  rgfinements.  Coarser  levels 

be  co-extensive,  allowing  fo  P  level  G  is  not  coarse  enough 

(g“^,  G  may  be  added  if  the  basic  level  g  general  algor- 

for  full-speed  multi-grid  sol^t^n.Q  (  ^  is  coarse  enough)  . 

ithm  for  coarsening  a  non-uniform  G  ,  and  usually 
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Data  structures,  similar  to  the  uniform  case  may  be  used,  but  should  be 

constructed  separately  for  each  basic  element  (or  each  set  of  identical 
basic  elements) . 


The  multi-grid  algorithm  is  the  same  as  in  Sec.  7.2.  The  discrete 
equations  are  thus  defined  separately  for  each  level.  The  reproduction 
o  ese  equations  during  relaxation  is  not  as  convenient  as  in  the 
strictly  miform  case,  but  still,  in  the  interior  of  any  basic 
element  the  equations  can  readily  be  read  from  fixed  tables, 
one  table  for  each  set  of  identical  basic  elements. 


hAi — j^cal  Transformations.  Another  important  generalization  of 
the  above  structure  is  to  subgrids  which  are  defined  each  in  terms  of 
another  set  of  variables.  For  example,  near  a  boundary  or  an  interface, 
the  most  effective  local  discretizations  are  made  in  terms  of  local 
coordinates  in  which  the  boundary  (or  interface)  is  a  coordinate  line. 

In  particular,  with  such  coordinates  it  is  easy  to  formulate  high-order 
approximations  near  the  boundary;  or  to  introduce  mesh  sizes  that  are 
different  across  and  along  the  interface  (or  the  boundary  layer) ;  etc. 
Usually  it  is  easy  to  define  suitable  local  coordinates,  and  uniformly 
discretize  them,  but  it  is  more  difficult  to  patch  together  all  these 
local  discretizations. 


A  multi-grid  method  for  patching  together  a  collection  of  local  grids 
G  ,G2,.../G^  (each  being  uniform  in  its  own  local  coordinates)  is  to  relate 
them  all  to  a  basic  grid  G^ ,  which  is  uniform  in  the  global  coordinates 

^hretches  over  the  entire  domain.  The  relation  is  essentially  as  above 
(Sec.  7.2) ;  namely,  finite-difference  equations  are  separately  defined  in 
the  inner  points  of  each  grid,  and  the  FAS  multi-grid  process  auto¬ 
matically  combines  them  together  through  its  usual  interpolation  periods. 

A  remark:  To  a  given  collection  of  local  grids  we  may  have  to  add 
intermediate  grids  to  obtain  fast  multi-grid  convergence.  That  is,  if  a 
given  local  grid  G^  is  much  finer  than  the  basic  grid  G  ,  we  have  to 
add  increasingly  coarser  grids,  all  of  them  uniform  grids  in  the  same 
local  coordinates,  such  that  the  coarsest  of  them  has  a  mesh  size  which 
is  (in  the  global  coordinates)  nowhere  much  smaller  than  the  basic  mesh 
size  h^ .  Similarly,  if  the  basic  global  grid  G^  is  not  coarse  enough, 

the  usual  multi-grid  sequence  of  global  grids  G^,  G^,...,G^  =  G  should 
be  introduced.  Thus,  in  each  set  of  coordinates  we  will  generally  have 
several  grids. 

Such  a  system  offers  much  flexibility.  Precise  treatment  of 
boundaries  and  interfaces  by  the  global  coordinates  is  not  required. 

The  local  coordinates  may  be  changed  in  course  of  computations,  e.g.,  to 
fit  a  moving  interface.  New  sets  of  local  coordinates  may  be  introduced 
(or  deleted)  as  the  need  arises. 

The  data  structure  required  for  creating,  changing  and  employing 
such  grids  is  basically  again  just  any  data  structure  suitable  for 
changeable  uniform  grids.  This,  however,  should  be  supplemented  by 
tables  for  the  local  transformations,  such  that  one  can  efficiently  (i) 
reproduce  the  local  difference  equation,  and  (ii)  interpolate  from  local 
to  global  grid  points,  and  vice-versa. 
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7.5,  Segmental  Refinement.  The  multi-grid  algorithm  for  non-uniform 
grids  (Sec.  1.2)  can  be  useful  even  in  the  case  of  unifora  grids,  if  the 
computer  memory  is  not  sufficiently  large  to  store  the  finer  levels. 

"Segmental  refinement"  is  the  refinement  of  one  subdomain  at  a  time 
To  see  why  and  how  j^is  is  possible,  observe  that  with  the  FAS  (Seg 

the  full  solution  u®^ is  obtained  on  all  grids.  But  on  a  coarser^grid  G 

the  u”  solution  satisfies  "corrected"  difference  equation,  with  F  =  % 
replacing  It  is  therefore  not  necessary  to  keep  the  fine  grid,  once 

has  been  computed. 

^  The  corrected  forcing  function  fJJ  can  be  computed  se^ental  refine¬ 
ment  Refining  only  one  subdomain,  one  can  iise  the  a  gori  ^ 

obtaiA  “Ld 

Si^rafi^eX^and  Sine  a  second  subdomain.  And  so  on,  through  a 
sequence  of  subdomains  covering  the  entire  domain. 

Sine,  sabsa^ent  ^ 

HSvfrrirpSS^inS  S(and  few  meshes  away 

“  s  ™£*..  r-„=i.— s  2. » 

only  few  neighboring  meshes. 

With  this  technique  one  can  operate  the  m^lti-grid 

finLf  qrii?'''^irS  LSS?iraed  S'^elSnary  (one-dimensional)  ~ 
numerical  tests. 

L^val!  stor.g.  requirement  can,  in  principle  be  reduced^ 


{1  +  log  I  /  log  J} 

locations,  where  h  is  the  finest  mesh-size  and  R  is  the  diameter  of  the 
domain.  No  external  memory  is  needed. 
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8.  ADAPTIVE  DISCRETIZATION  TECHNI9ITES  . 

The  previous  section  described  a  flexible  data  structure  and  solution 
process  which  facilitate  implementation  of  variable  mesh-sizes  h.  The 

process  are  always  defined  at  inner  points  of 
orderrD  niake  it  easy  to  employ  high  and  variable  approximation 

^  "*®®^~s'^2es  and  approximation-orders  are  to  be  chosen? 

Should  boundary  layers,  for  examples,  be  resolved  by  the  grid?  What  is 
eir  proper  resolution?  Should  we  use  high-order  of  approximation  at  such 
layers.  How  to  detect  such  layers  automatically?  In  this  section  we  pro¬ 
pose  a  general  framework  for  automatic  selection  of  h  and  p  in  a  (nearly) 
optimal  way.  In  the  next  (Sec.  9)  we  will  study  some  special  cases,  and 
show  how  this  proposed  system  automatically  resolve  or  avoid  from  resolving 
thin  layer,  depending  on  the  alleged  goal  of  the  confutations. 


Principles.  We  will  treat  the  problem  of  selecting  the 
discretization  parameters  h  and  p  (and  possibly  other  parameters,  see 
Sec.  8.4)  as  an  optimization  problem:  We  will  seek  to  minimize  a  certain 
error  estimator  E,  subject  to  a  given  amount  of  coiif utational  work  W. 

(Or,  equivalently,  minimize  the  work  W  to  obtain  a  given  level  E  of 
the  error  estimator.  We  will  see  that  the  actual  control  quantity  is 
neither  E  nor  W,  but  their  rate  of  exchange.)  It  is  important,  however, 
to  promptly  emphasize  that  we  should  not  take  this  optimization  too 
pedantically  it  is  enough,  for  instance,  to  obtain  E  which  is  one  or  two 
orders  of  magnitudes  larger  than  the  minimum  (or,  equivalently,  to  invest 
work  W  which  is  by  some  fraction  more  than  theoretically  needed.  Note 
below  that  is  usually  proportional  to  W)  .  Full  optimization 

IS  not  our  purpose,  is  enormously  harder  and,  in  fact,  is  self-defeating, 
since  it  requires  too  much  computational  work  to  be  invested  in  controling 
h  and  p.  We  will  aim  at  having  the  control  work  much  smaller  than  the 
actual  nuinerical  work  W,  using  the  optimization  problem  only  as  a  loose 
directive  for  sensible  discretization. 

The  Error  Estimator  E  is  a  functional  that  estimates  the  overall  error 
in  solving  the  differential  boundary-value  problem,  in  terms  of  euiy  given 
numerical  approximation.  In  principle,  such  a  functional  should  be 
furnished  whenever  a  problem  is  submitted  for  numerical  solution;  in 
practice,  it  is  seldom  provided.  To  have  such  an  estimator  depends  on 
having  a  clear  and  well-defined  idea  about  the  goal  of  the  confutations , 
i.e.,  an  idea  about  what  error  norm  we  intend  to  minimize.  Given  the  goal, 
even  roughly,  we  can  usually  formulate  E  quite  easily.  We  assume  that 
the  numerical  approximation  U  is  in  some  suitable  neighborhood  of  the 
true  solution  (this  is  a  necessary  and  justifiable  assuirftion;  see  Sec. 

8.2) ,  so  that  E  can  be  written  as  a  linear  functional 

(8.1)  E  =  J  G(x)  T(x)  dx  . 

t(x)  is  a  local  estimatej^of  the  truncation  error  i.e.,  the  error  by  which 
the  numerical  solution  U  fails  to  satisfy  the  differential  equation  LU=F; 
or  more  conveniently,  the  error  by  which  the  differential  solution  U 
fails  to  satisfy  the  discrete  equation  L  U  =F.  That  is. 


544 


t(x)  =  1lU(x)  -  L^J(x)  1 


(8.2) 

G(x)  is  the  non-negative  "error-vreighting  function”  (or  distribution) / 
through  which  the  confutations  goal  should  be  expressed. 

The  choice  of  G  can  be  crude.  In  fact,  multiplying  G  by  a  con¬ 
stant  does  not  change  our  optimization  prc*)lem.  Also,  we  c^  make  large 
errors,  up  to  one  or  two  orders  of  magnitudes,  in  the  relative  values 
of  G  at  two  different  points,  since  we  are  content  in  having  E  only  to  a 
accuracy.  What  matter  are  only  large  changes  in  G,  e.g. ,  near  boundaries. 
For  example,  if  we  have  a  uniformly  elliptic  problem  of  order  m,  and  if 
we  are  interested  in  computing  good  approximations  to  U  and  its  derivatives 
up  to  order  H  and  upto  the  boundary,  then  a  suitable  choice  is 

m/2-J!, 


(8.3) 


G(x)  =  d. 


where  d  is  the  distance  of  x  from  the  boundary.  (The  formula  should  be 
suitably  modified  near  a  boundary  comer).  This  and  similar  ^choices  of 
G  are  easily  found  by  local  one-dimensional  crude  analysis  of  relatio 

between  a  perturbation  in  the  equations  and  the  resulting  pertu^ation 
in  the  quantity  we  wish  to  approximate.  Even  though  crude,  such  ohpige  of  G 
would  specify  our  goal  much  closer  than  people  usually  bother  to.  Mbre 

we  ch«.g.  G  if  we  leam  that  it  fails  to  properly  weigh  a  certain 
region  of  the  computation;  it  can  serve  as  a  convenient  rontrol,  conveying 
our  intentions  to  the  numerical  discretization  and  solution. 

ThP  Work  Functional  W.  In  solving  the  discrete  equations  by  ^e  multi¬ 
grid  method,  the  main  overall  computational  work  is  the  nu^er 
Units  invested  in  relaxations ,  times  the  amount  of  confutations  in  each 
Work  unit  (see  Sec.  6)  .  If  the  discretization  and  relaxation  sd^^s 
are  suitable,  the  number  of  Work  Units  is  almost  ^ndependen^^yf  toe 

relaxation  parameters  h  and  p.  (See  e.g.,  the  ^  w  oniv 

in  Table  1  ^ove)  .  Since  for  our  optimization  problem  we  need  W  only 

If  a  milSpScative  constant,  we  can  take  into  account  only  toe  ™t 
of  confutations  in  a  single  Work  Unit,  i.e.,  the 
sweep  over  the  domain.  The  local  number  of  grid  points  per  unit 
is  hU)"^,  and  the  amount  of  computation  at  each  grid  point  is  a 
w(p(x)) ,  where  p(x)  is  toe  local  order  of  approximation.  Hence,  we  can 

regard  the  work  functional  as  being 


(8.4) 


W 


-  /  - 

a 


(p(x)  ) 


dx. 


h(x) 

Global  Optimization  Equations.  Treating  the 
as  spatial  varices,  h(x)  ^dTU)  ,  toe  Euler  equations  of  minimizing  E 


for  fixed  W  are 

(8.5a) 

+  X 

3h(x) 

(8.5b) 

+  X 

9p(x) 

9W 


9h(x) 

9W 

9p(x) 


=  0 


=  0 
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where  X  is  a  constant  (the 
X  is  actually  the  marginal 
accuracy,  i.e.. 


dE 


(8.6) 


X  =  - 


min 


dW 


=  E 


Lagrange  multiplier) .  It  is  easily  seen  that 
rate  of  exchange  between  work  and  optimal 

I 

dW  ' 


and  the  meaning  of  (8.5)  is  that  we  cannot  lower  E  by  trading  work  (e.g., 
by  taking  smaller  h  at  one  point  and  larger  at  another,  keeping  W 
constant,  or  trading  a  change  in  h  with  a  change  in  p)  . 

Equations  (8.5)  make  some  essential  simplifications  in  the 
optimization  problem:  They  regard  h  and  p  as  defined  at  all  points 
xeQ;  Also,  h  and  p  are  assumed  to  be  continuous  variables,  whereas  in 
practice  they  are  discrete.  (p  should  be  a  positive  integer,  in  some 
schemes  a  positive  even  integer.  Values  of  h  are  restricted  by  some 
grid-organization  considerations.)  These  simplifications  are  crucial  for 
our  approach,  and  they  are  altogether  justified  by  the  fact  that  we  are 
content  in  having  only  an  approximate  optimum.  The  practical  aspect,  of 
choosing  permissible  h  and  p  close  to  the  solution  of  (8.5),  is  discussed  in 
Sec.  (8.3) .  One  restriction  we  should,  however,  take  into  account  in 
the  basic  equations,  namely,  the  restriction 

(8-7)  PqIp(x)  f.Pj^(x). 

Without  such  a  restriction,  the  optimization  equations  may  give  values 
of  p  which  cannot  be  approximated  by  permissible  values,  p  is  usually 
1  or  (in  symmetric  schemes)  2.  The  upper  bovind  p  may  express  the  high¬ 
est  feasible  order  due  to  round-off  errors;  or  the  highest  order  for  which 
we  actually  have  appropriate  (stable)  discretization  formulae,  with  special 
such  restriction  near  boundaries  (hence  the  possible  dependence  of  p  on 
the  position  x) .  With  this  restriction,  Euler  equation  (8.5b)  should  be 
rewritten  as 

if  p(x)  =  p^ 

if  p^  <  p(x)  <  p^(x) 

if  p(x)  =  p^(x)  . 

Local  Optimization  Equations.  Substituting  (8.1)  and  (8.4)  into 
(8.5a)  and  (8.8)  ,  we  get  the  following  equations  at  each  point  x  e  fl: 


(8.9a) 

G  —  - 
^  3h 

w(p)  0 

8.9b) 

9p 

(p)  >  0 
h^^  <  ' 

(8.8) 


3E 


9p(x) 


+  X 


9w 


9p(x) 


>  o, 


=  o 


<  O, 
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where  the  equality-inequality  sign,  in  (8.9b)  and  hereinafter,  corresponds 
to  the  three  cases  introduced  in  (8.8).  In  principle,  the  pair  of 
equations  (8.9)  determines,  for  each,  x  e  0,  the  local  values  of  the  pair 

(h,p) ,  once  X  is  given. 

Thus  X  is  our  global  control  parameter.  Choosing  larger  X,  we  will 
get  an  optimized  grid  with  less  work  and  poorer  accuracy;  lowering  ,  we 
Sve“  More  work  La  get  higher  accur.oy.  For  each  X.  however,  we 
get  (approximately)  the  highest  accuracy  for  the  work  invested. 

SncipL  X  should  be  given  by  whoever  submits  the  problem 
solution;  i.e.,  he  should  tell  at  what  rate  of  he  is  willing 

to  invest  computational  work  for  additional  accuracy  (see  ^ 

practice  this  is  not  done,  and  X  usually  serves  as  a  convenient  control 

parameter  (see  Secs.  8.2  and  8.3). 

TO  compute  h  and  p  from  (8.9)  we  should  know  the  behavior  of  x  as  a 
function  of  h  and  p.  Generally/ 

(8.10)  T(x,h,p)  ^  t(x,p)  h^, 

where  t(x,p)  depends  on  the  equations  and  on  the  solution.  Since  it  is 
Iss^Jed  t);l  an  our  numerical  approximations 

“  -  - -rL  •  ioL;  o  w  upon  - 

h  and  p  (see  Sec.  8.3) ,  so  that  we  need  to  estimate  t(x,  ,p)  Y 
h  and  p  close  to  the  current  h(x)  and  p(x). 

Oh,  pr^UM^PW. 

for  each  value  P?y")  i^  the  target  (t8e  given)  pro- 

in  steps  oy*  At  eacn  srep  ;5c;  a  first  approximation  in 

,o,  g««pclation  con- 

an  iterative  process  ^3  approximations  we  use  in 

tinuation  procedures  is  to  ensure  tha  PP ^  solution  (of  the 

tho  iterative  process  are  «l2"Ll^irLr«ies  Lf  Lintainea.  Osually 
current  P(y))p  so  that  some  number,  the  Mach  number, 

lL!)Tt"Srff  SSchLi^r^e  aifferential  eguations  or  the  boondary 

conditions,  or  both,  are  expressed. 

«.e  continuation  process  is  not  >  »““^/“,“:::L,TS'.»selLs , 
„n>ny  cases,  the  ‘"“^^xate  pro  e  s^  physical  pt<*leM. 

since  they  correspond  to  q  discretized  problems  the  continue- 

r  ^rceir-iLotLrrLLTd^- " 
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effect,  the  only  way  to  define  the  solution,  i.e.,  the  way  to  select  one 
out  of  the  many  solutions  of  the  non-linear  algebraic  system.  The  desired 
solution  is  defined  as  the  one  which  is  obtained  by  continuous  mapping 
roin  j-YQ/"  to  tiie  solution  space  with  a  given  solution  at  y  (e.g.  / 
the  single  solution,  if  PCy^)  is  linear).  By  the  continuation^process , 
we  keep  every  intermediate  numerical  solution  in  the  vicinity  of  a 
phy^ic^  solution  (to  an  intermediate  problem),  hence  the  target  numerical 
solution  is,  hopefully,  near  the  target  physical  solution,  and  is  not 
some  spurious  solution  of  the  algebraic  system.  Thus,  although  sometimes 
we  may  get  away  without  a  continuation  process  (simply  because  a  starting 
solution  is  "close  enough",  so  that  the  continuation  may  be  done  in  just 
one  step) ,  in  principle  a  continuation  process  must  be  present  in  any 
numerical  solution  of  non-linear  problems*  Moreover,  such  a  process  is 
usually  inexpensive,  since  it  can  be  done  with  crude  accuracy,  so 
that  its  intermediate  steps  usually  total  less  conputational  work  than 
the  final  step  of  computing  an  accurate  solution  to  P(y*) . 

A  continuation  process  is  necessary,  in  principle,  not  only  for  non¬ 
linear  problems,  but  also  for  linear  problems  with  grid  adaptation.  In 
fact,  when  h  or  p  are  themselves  unknown,  the  discrete  problem  is 
nonlinear,  even  if  the  differential  problem  is  linear. 

In  our  system,  a  continuation  process  with  crude  accuracy  and 
little  work  is  automatically  obtained  by  selecting  a  large  value  for  the 
control  parameter  X  (cf.  Sec.  8.1).  Then,  in  the  final  step  (y=y^) ,  X  is 
decreased  to  refine  the  solution.  Thus,  the  overall  process  may  be 
viewed  as  a  multi-grid  process  of  solution,  controlled  by  the  two  para- 
eters  y  and  X. 

.'■..ne  most  efficient  way  of  changing  y  is  probably  to  change  it  as 
soon  as  possible  (e.g. ,  when  the  multi-grid  processing  exhibits  convergence 
to  a  crude  tolerance)  and  to  control  the  step-size  6y  by  some  automatic 
procedure,  so  that  fiy  is  sharply  decreased  when  divergence  is  sensed  (in 
the  multi-grid  processing) ,  and  slowly  increased  otherwise. 

In  changing  y  it  is  advisable  to  keep  the  residuals  as  smooth  as 
possible,  since  higher  frequency  components  are  more  expensive  to  liquidate 
(lov/er  components  being  liquidated  on  coarser  grids).  Thus,  for  example, 
if  a  boundary  condition  should  be  changed  while  changing  y,  it  is  advisable 
to  introduce  this  change  into  the  system  at  a  stage  when  the 
algorithm  is  to  start  working  on  the  coarsest  grid. 

y-Extrapolation .  In  some  cases  the  given  problem  (y=y^)  is  much  too 
difficult  to  solve,  e.g.,  because  the  differential  solution  fluctuates 
on  a  scale  too  fine  to  be  resolved.  In  such  cases  one  is  normally  not 
interested  in  the  details  of  the  solution  but  rather  in  a  certain 
functional  of  the  solution.  It  is  sometimes  possible  in  such  cases  to 
solve  tlie  problem  for  certain  values  of  y  far  from  y^,  and  to  extrapolate 
the  corresponding  functional  values  to  y=y*  . 
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8.3.  Practice  of  Discretization  Control.  The  main  practical  re- 
strictions  imposed  on  the  theoretical  discretization  equations  (8.9)  are 
the  following:  The  approximation  order  p  should  be  a  positive  integer. 

In  many  problems  p  is  also  restricted  to  be  even,  since  odd  orders  are 
less  efficient.  The  mesh-size  function  h(x)  should  be  such  that  a  reasonable 
grid  can  be  constructed  with  it.  Thus,  in  the  ggid  structure  outlined  in 
Sec.  7.1,  h  is  restricted  to  be  of  the  form  h=2  h^,  where  k  is  an  integer. 

Also,  in  the  multi-grid  discretization  method  outlined  in  Sec.  7.2,  any 
uniform  subgrid  truly  influences  the  global  solution  only  if  it  is  large 
enough,  i.e.,  if  at  least  some  of  its  inner  points  belong  also  to  coarser 
grids.  These  discretization  restrictions  will  actually  help  us  in  meeting 
another  practical  requirement,  namely,  the  need  to  keep  the  control-work 
(con^uter  work  invested  in  testing  for  and  affecting  discretization  refor¬ 
mulations)  small  compared  with  the  numerical  work  (relaxation  sweeps  and 
interpolations) . 


The  practical  adaptive  procedure  is  proposed  to  be  generally  along 
the  following  lines: 

A.  Testing.  In  the  multi-grid  solution  process  (possibly  inco^orating 
a  continuation  process) ,  at  some  natural  point  we  get  an  estimate  of  the 
decrease  in  the  error  estimator  E  introduced  by  the  present  discretization 
parameters.  For  exanple,  in  FAS  Cycle  C  (see  its  flowchart  in  Fig.  2),  at 
the  point  where  new  F^  is  computed,  the  quantity 


(8.11)  -AE  =  G  |f^  - 


at  each  point  may  serve  as  a  local  estimate  for  the  decrease  in  E  per 
unit  volume  (cf.  (8.1)  and  (5.7))  ,  owing  to  the  refinement  \  Vl* 

Each  such  decrease  in  E  is  related  to  some  additional  work  AW  (per 
unit  volume).  For  example,  the  refinement  from  h^^  to  hj^^^  requires 
the  additional  work 


(8.12) 


iw-SM 


k+1 


(per  unit  volume) . 


He„c  w.  conputa  the  rMlo  of  exohangiog  accuracy  per  0  -  -  “  /  “■ 

At  regions  where  this  ratio  is  much  bigger  than  X  (the  control 


exchange;  cf.  Sec.  8.1)  we  say  that  the  present  parameter 
example)  is  highly  profitable  and  it  is  worth  trying  tj^fur^er  refine  the 
discretization  (e.g.,  introduce  there  the  subgrid  G  witt  \+2  ^+l/ 

At  regions  whep^^Q  is  much  smaller  than  1  we  may  coarsen  the  discretization 

(abolish  the  G^  ^  subgrid) . 

Extraoolated  tests.  More  sophisticated  tests  may  be  based  on  assuMng 
the  truncation  error  to  have  some  form  of  dependence  on  h  and  p,  su^ 

(8  10)  above.  Instead  of  using  Ae  and  AW  at  the  previous  change  (from 
II  to  h.  . ,  in  the  above  example)  we  can  then  anticipate  the  correspon  in 


k+l' 


(  from 


to 


\+2^  ' 


which  are 


values  ae  and  AW  at  the  next  change  \ 

the  more  appropriate  values  in  testing  whether  to  make  that  next  change. 
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Thus,  in  the  above  exait^le,  assuming  (8. IQ)  and  hj^^^  =  ^+i/2  =  we 

get  Ae  =  2~^  AE  ,  AW  =  2*^  AW,  and  hence 


(8.13) 


^  2“^“^  Q  =  _ ^k+1  ^ 

Aw  w(p)  (2*^  -1)  2^ 


The  extrapolated  ratio  Q  is  used  in  testing  for  grid  changes.  This  may 
seem  risky,  since  it  depends  on  assuming  (8.10) .  But  in  fact  there  is 
no  such  risk,  because  we  can  see  from  (8.13)  that  testing  with  Q  is  not 
that  much  different  from  testing  with  Q.  (In  fact,  if  p  is  constant, 
testing  with  Q  is  equivalent  to  testing  with  Q  against  another  constant  X.) 

test  with  Q  does  not  presume  (8.10);  it  only  assumes  that  the  finer 
(G  )  approximation  is  considerably  better  than  the  coarser  one,  so  that 
their  difference  roughly  corresponds  to  an  added  accuracy  due  to  the 
refinement.  Note  also  that  the  multi -grid  stopping  criteria 
((A. 17)  or  (A. 20)  in  App.  A)  are  precisely  such  that  Q  can 
be  reliably  computed  from  the  final  approximation. 

B.  Changing  the  discretization.  The  desired  grid  changes  are  first 
just  recorded  (e.g.,  incidentally  to  the  stage  of  computing  r)  and  only 
then  they  are  simultaneously  introduced,  taking  into  account  some  organ¬ 
izational  and  stabilizational  considerations:  A  change  (e.g.,  refinement) 
is  introduced  only  if _ there  is  a  point  where  the  change  is  “overdue" 

(e.g. ,  a  point  where  Q  >  lOX) ,  Together  with  such  a  point  the  change  is 
then  also  introduced  at  all  neighbor  (and  neigWDor  of  neighbor,  etc.) 
po^ijij^s  where  the  change  is  "due"  (e.g.,  where  Q  >  3X)  .  The  changed  subgrid 
(G  in  the  above  example)  is  then  augmented  as  follows;  (i)  Around  each 
new  grid  point  we  add^^jtra  points,  if  necessary,  so  that  the  grid  point 
(corresponding  to  a  G  point  where  a  ref^n^ment  was  due)  becomes  an  inner 
point  (cf.  Sec.  7.2)  in  the  new  subgrid  (G  ).  (ii)  Holes  are  filled; 
that  is,  if,  on  any  grid  line,  a  couple  of  points  are  missing  in  between 
grid  points,  these  missing  points  are  added  to  the  grid. 


The  control  work  in  this  system  is  negligible  compared  with,  say, 

k+1 

the  work  of  relax^ijij  over^G  ,  because;  (i)  The  tests  are  made  in 
transition  from  G  to  G  ,  which  takes  place  only  once  per  several  G 

relaxation  sweeps.  (ii)  Q  is  computed  and  tested  only  at  points  of  the 
coarser  grid  G  ,  and  at  each  such  point  the  work  is  smaller  than  the  relax¬ 
ation  work  per  point.  (iii)  Changing  the  discretization  is  itself  inex¬ 
pensive  since  it  is  done  by  extending  or  contracting  uniform  grids  (cf. 

Sec.  7.1)  ,  the  main  work  being  in  inteDrpolating  the  approximate  solution 
to  the  new  piece  of  uniform  subgrid. 


8 . 4  Generalizations .  In  some  problems  not  enough  to  adapt  h 

and  p.  Sometimes  different  increments  h'  ,  h^  ,...,  h^  should 
be  used  at  the  d  different  directions,  and  each  h  should  be  separately 
adapted.  Basically  the  same  procedures  as  .^ove  c^  be  used  to  test 
and  execute,  for  example,  a  change  from  h  ^  to  h^^  /2.  More  generally, 
one  would  like  to  adapt  the  local  coordinates  (cf.  Sec.  7.4),  e.g.,  near 
discontinunities .  Automatic  procedures  for  such  adaptation  have  not 
been  so  far  developed,  but  are  conceivable. 
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other  discretization  parameters,  such  as  the  centering  of  each 
term  in  the  difference  operator,  may  be  treated  adaptively.  (In  fact, 
such  adaptive  discretization  is  already  in  use  in  mixed-type  problems, 
where  it  was  introduced  by  Murman  to  obtain  stability.  See,  e.g.,  [9]). 
In  problems  with  unbounded  domains,  the  discrete  domain  may  be  determined 
adaptively  (with  increasingly  coarser  levels;  cf.  Sec.  7.1) ,  using  a  pro¬ 
cedure  that  decides  to  extend  the  domain  if  the  previous  extension  was 
highly  profitable  in  terms  of  -AE/AW.  In  many  problems,  some  terms  ^n 
the  difference  operator  can  altogether  be  discarded  on  most  levels  G  . 

In  particular,  in  singularly  perturbed  problems,  the  highest  order 
terms  may  be  kept  only  on  the  finest-narrowest  levels.  Decision  can 
again  be  made  in  terms  of  -AE/AW,  in  an  obvious  way. 


9.  ADAPTIVE  DISCRETIZATION;  CASE  STUDIES. 

To  get  a  transparent  view  of  the  discretization  patterns  and  the 
accuracy-work  relations  typical  to  the  adaptive  procedures  proposed  above, 
we  consider  now  several  test  cases  which  are  single  enough  to  be  analyzed 
in  closed  forms.  That  is,  we  consider  problems  with  known  solutions  and 
single  behavior  of  the  local  truncation  errors,  and  we  calculate  the 
discretization  functions  h(x)  and  p(x)  that  would  be  selected  by  the  local 
optimization  equations  (8.9),  and  the  resulting  relation  between  the  error 
estimator  E  and  the  computational  work  W. 


9.1  Uniform-Scale  Problems.  A  problem  is  said  to  have  the  uniform 
scale  ri(x)  if  the  local  trxincation  error  (8.2)  has  the  behavior 

P 

r  (Pq  1  P  1  Pi^  • 


(9.1)  T(x,h,p)  'V'  t(x) 


n(x) 


Such  a  behavior  occurs,  for  exait«)le,  when  the  solution  is  a  trigonontetric  or 
exponential  function  exp(0-x) ,  where  0  is  either  a  constant  or  a  slowly 
varying  function  (see  example  in  Sec.  9.2).  We  will  also  assume  for  sim¬ 
plicity  that  (see  (8.4)) 


/  X  J*- 

(9.2)  w(p)  =  w^p 

Usually  1=1,  since  the  nuiriber  of  terms  in  the  difference  equations,  and 
hence  also  the  amount  of  computer  operations  at  each  grid  point,  are 
proportional  to  p.  1=2  is  appropriate  if  we  assume  that  we  have  to 
increase  the  precision  of  our  arithmetic  when  we  increase  p.  Rescaling 
W,  we  can  assume  that  w^  =  1. 

Using  (9.1-2)  in  equations  (8.9)  we  get 
i-1  ,-d 

(9.3a)  Gt  =  Xd  p  h  , 

h  il-l  -d  > 

(9.3b)  Gt  log  -  +  XJ,  p  h  -  o  . 

Hence,  denoting  by  p  the  value  of  p  that  satisfies 
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(9.4) 

1-1  ip/d  ,-i  a  -i  ^-1 

p  e^'=A  Gtne  d  , 

we  have 

(9.5a) 

h  =  n  e  , 

a. 

P  =  P/ 

if 

v| 

V  1 

o 

(9.5b) 

,  £-1  ^0-1-1 
h  =  (Xd  Ti  t  G  )  , 

fi 

o 

if 

P  iPo  ' 

(9.5c) 

h  1/(PjL+<3) 

h  =  (Xd  Pj^  n  t  G  )  , 

P  ^  Pj^f 

if 

Pi  IP  • 

Notice 

r .  e .  ,  1 

that  at  any  point  either  p  or  h,  but 
dependent  of  A.  Where  p  is  adaptive 

never  botj^, 
(Pn  1  P  =  P 

is 

"adaptive " , 

) ,  h  is  fixed  and 

d  Z 

each  "scale  cube"  x]  is  divided  into  e  mesh  cells. 


Assiime  now  further  that  the  computer  precision  is  unlimited  (which  is 
never  really  the  case,  but  may  provide  insight),  so  that  £=1  and  p  =<».  If 
sufficiently  high  accuracy  is  desired,  then  A  is  sufficiently  small  to  have 
so  that  (9.5a)  applies.  By  (8.1)  and  (9.3a)  this  implies 

(9.6)  B  =  Xde  f  r\~^dx  , 
and  hence,  by  (8.6) , 


(9.7)  E  =  C  e-W/(«<3/n''^dx)  ^  ^  -c  3^ 

o  o 

where  6  is  some  average  value  of  the  scale  n (x) .  In  this  (idealized) 
case,  E  decreases  exponentially  with  W.  For  realistic  W  this  convergence 
rate  becomes  poor  when  3  is  very  small,  as  in  singularly  perturbed  problems. 
In  such  problems,  however,  for  realistic  W  (9.5a)  no  longer  applies, 
and  another  rate  of  convergence,  independent  of  3,  takes  over  (see  Sec. 

9.3)  . 


Consider  a  2-point  boundary -value  problem 
in  0<x<l, 

with  constant  ti>0  and__^ith  boundary  conditions  U(0)  and  U(l)  such  that 
the  solution  is  U  =  e  ^  An  elliptic  (stable)  difference  approximation 
to  such  an  equation  can  be  central  for  n^h  but  should  be  properly  directed 
for  n<h.  (The  first  order  term  being  the  main  term,  the  second  order 
term  shoiild  be  differenced  backward  relative  to  it  with  approximation 
order  p*  =  p  -  [logn/logh] .  See  [4]  and  Sec.  3.2  in  [3]).  In  either  case, 
the  truncation  error  is  approximately 


9.2.  One-Dimensional  Case . 


(9.8) 


d^U 

dx^ 


dx 


=  0 
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(9.9) 


where  t (x) 


T {x,h,p) 


=  t(x) 


1  -2x/n 
2n  ® 


We  now  choose  the  error  weighting  fxinction  to  be 


(9.10)  G(x)  =  1, 

which  would  be  the  choice  (see  (8.3))  when  one  is  interested  in  accurate 
computation  of  boundary  first-order  derivatives  (corresponding,  e.g. ,  to 
boundary  pressure  or  drag,  in  some  physical  models) •  We  again  assume  no 
precision  limitations,  so  that  £=1  and  p  We  take  P^-2  since  second- 
order  is  no  more  eu^ensive  than  first— order  approximations .  Inserting 


for  0  <  X  <  X  , 
—  o 

for  X  <  X  <  1  , 


where 

(9.11c)  ^0=2  ' 

If  X  > 
o  - 

(9.12) 


(9.13) 


and  the  condition  x  >  1  itself  becomes,  by  (9.11c,  12), 

o  — 

(9.14)  W  ^  (2+^)  ^  . 

Thus,  if  W  satisfies  (9.14),  E  converges  like  (9.13). 


1,  then  (9.11a)  applies  throughout,  and  hence 


E  =  Tdx  = 
0 


Ae^ 

n 


-2.w-i 


2n 


these  into  (9.5)  we  get 
(9.11a) 


h  -  IL 
^  -  e  ' 


p  =  log^ 


(9.11b) 


n  2 (x-x  )/(3n) 
h  =  —  e  o  , 


p  =  2  , 


9.3.  Singular  Perturbation:  Boundary  Layer  Resolution.  When  n  is 
very  small,  problem  (9.8)  is  singularly  perturbed,  and  its  solution  has 
a  boundary  layer  near  x=0.  The  above  mesh— size  h=Ti/e  is  too  small  to 
be  practical.  Indeed,  in  the  optimal  discretization  (9.11),  for  small  n  we 
get  small  x  ,  and  an  "external  region"  x^  _<  x  <  1  is  formed  where  the 
mesh  size  grows  exponentially  from  n/e.  The  small  mesh  size  is  used 
only  to  resolve  the  boundary  layer.  In  this  simplified  problem  the  solution 
away  from  the  boundary  layer  (i.e.,  for  x>>ri)  is  practically  constant, 
so  that  indefinitely  large  h  is  suitable.  Usually  h  will  grow  exponentially, 
as  in  (9.11b),  from  h  =  to  some  definite  value  suitable  for  the 
external  region.  In  the^ transit ion  region  we  have  p=2,  i.e.,  the  minimal 


553 


order  of  differencing  is  used  in  the  region  where  h  changes.  This  may 
be  useful  in  practical  implementations. 

From  (9.11)  and  (9.9)  we  get  for  small  ri 


(9.15) 


(9.16) 


W 


=  /  ^  <3x  ^  I  (logj^) 


E  = 


1 

/t 

0 


dx 


Xe  -  1 

■  ^  2X 


where  the  integrals  are  separately  calculated  .in  (o,X  )  and  (X  ,1).  Thus, 
E  converges  exponentially  as  a  function  of  instead  of  W,  °but  this 

^®te  is  independent  of  ri  and  does  not  deterioriate  as  n  0. 


9.4.  Singular  Perturbation  without  Boundary-Layer  Resolution.  To  see 
the  effect  of  choosing  different  error  weighting  functions,  consider 
again  the  above  problem  (Secs.  9.2,  9.3),  but  with  the  choice  G(x)  =  x. 

This  choice  is  typical  to  cases  where  one  is  not  interested  in  calculating 
boundary  derivatives  of  the  solution  (see  (8.3)).  We  then  get 

(9.17)  ^  =  log  ^  -1  -  ^  <  log  ^  -  2  . 


Therefore,  for  small  n  and  reasonable  X,  p  <  o  and  p=2  for  all  x.  Hence, 
no  resolution  of  the  boundary  layer  is  formed •  Indeed,  by  (9.5b) ,  for 
very  small  ri  (singular -perturbation  case) 

(9,18) 

so  that  h>>Ti  .  In  practical  situation  where  the  solution  in  the  external 
region  is  not  constant,  the  actual  mesh-size  will  be  determined  by  the 
external  regime. 


/h\  ^  H  >  iAi 

(hj  X  ®  -  T1 


9.5.  Boundary  Corners ,  Consider  the  two-dimensional  Poisson  equation 


AU=F  with  smooth  F  and  ^omogeneous  boundary  conditions ,  near  a  boundary 
comer  with  angle  tt/u,  ^  ot  <  1.  Denoting  by  r  the  distance  from  the 
comer,  at  small  r  the  solution  U  is  0{r  ),  and  so  is  also  the  error 
weighting  function  G  (if  accuracy  is  sought  in  the  solution ,2but  not 
in  its  derivajives^gear  the  boundary).  Hence,  x  =  0(h^r^  ^  )  and 

9T/9h  =  0(h^  P  )  ^  If  we  fix  the  order  of  approximation  p,  then  the 
optimal  mesh-spacing  derived  from  (8.9a)  is 


(9.19) 


h  =  0(X 


1/  (p+2) 


_  P+2-2a 

^  p+2 
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Hence,  by  (8.4)  and  (8.1)  the  total  work  and  total  error  contribution  from 
a  region  of  radius  r  around  the  corner  are,  respectively, 


W  = 

E  = 


/h^ 

I 


dxdy 


G  T  dxdy 


2-2g 

r 

2-28 

r 

Hence  the  relation  E  ^  W  (the  usual  relation  in  d-dimensional  smooth 

problem  with  p-th  order  approximation)  still  holds  uniformly.  The  corner 
does  not  "contaminate"  the  global  convergence. 

k 

In  the  practical  grid  organization  (Sec.  8.3)  finer  levels  G  with 

increasingly  smaller  mesh-sizes  h,  =  2  h  will  be  introduced  near  the 

comer.  By  (9.19)  ,  the  level  G  will  extend  from  the  corner  to  a  dis 

tance  r  =  C  Since  B<1,  for  small  h  we  get  >  r^..  This 

k  k 

gives  us  in  practice  a  natural  stopping  valup  for  the  refinement  process: 

The  finest  mesh-size  near  the  comer  is  such  that  h^  '''  4rj^  ,  so  that  level 

k  .  ^  ^k-1 

G  still  has  an  inner  point  belonging  to  G 


9.6.  Singularities.  Like  boundary  corners,  all  kinds  of  other  pro¬ 
blem  singularities ,  when  treated  adaptively ,  cause  no  degradation  of  the 
convergence  rate  (of  E  as  function  of  W) . 

Consider  for  example  the  differential  equation  LU=F  where  F  is^ 
smooth  except  for  a  jump  discontinuity  at  x=0.  Whatever  the  approximation 
order  p,  the  system  will  find  -AE  (see  (8.11))  to  be  0(1)  at  all  points  ^ 
whose  difference  equation  include  values  on  both  sides  of  th.e  discontinuity. 
At  these  points  further  refinements  will ,  therefore ,  be  introduced  as  long 
as  -AE/AW  >  0(X).  Thus,  around  x=0,  some  fixed  number  (depending  only 
on  p)  of  mesh  points  will  be  introduced  at  each  level  G  ,  until  a  mesh 
size  ii  =  0(1^'^°)  is  reached.  The  total  amount  of  added  work  is  there¬ 
fore  proportional  to  the  number  of  levels  introducgd,  which  is  0(log  h) . 

The  error  contribution  of  the  discontinuity  is  0(h  ),  which  is  exponentially 
small  in  terms  of  the  added  work. 

This  and  similar  analyses  show  that  the  adaptive  scheme  retains  its 
high-order  convergence  even  when  the  problem  is  only  piecewise  smooth, 
or  has  algebraic  singularities,  etc. 


10.  HISTORICAL  NOTES  AND  ACKNOWLEDGEMENTS. 

Coarse-grid  acceleration  techniques  were  recommended  and  used  by  several 
authors,  including  Southwell  [24,13,14],  Stiefel  [15],  Fedorenko  [5]  ,  ^^ed 
[19] ,  Wachspress  [17] ,  de  la  Val^e  Poissin  [16]  and  Settari  and  Aziz  [24] . 
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Southwell  called  his  technique  "block"  and  iiiore  generally  "group  relax¬ 
ation",  described  it  as  "almost  essential  to  practical  success",  and  gave 
heuristic  explanation  as  well  as  practical  implementation  methods  based  on 
variational  considerations  ("the  aim  being  to  reduce  the  total  energy  by 
as  great  an  amount  as  possible") .  He  also  depicted  procedures  of  "advance 
to  a  finer  net"  [14] .  Techniques  of  multiplicative  coarse-grid  corrections 
(special -cases  of  which  appeared  in  [14] ,  [19])  were  developed  by  Wachspress 
([17],  Chapter  9),  who  called  them  "variational  techniques".  This  work 
motivated  several  studies,  by  Froelich,  Wagner,  Nakamura  and  Reed  (see  a 
brief  survey  in  [18])  and  was  applied  in  nuclear  reactor  design  computations. 

All  these  were  two-level  methods.  The  multi-grid  idea  was  introduced 
by  Fedorenko  [6] ,  mainly  for  theoretical  purposes.  Namely,  he  rigorously 
proved  that  W(n,e) ,  the  number  of  operations  required  to  reduce  the  residuals, 
of  a  Poisson  problem  on  a  rectangular  grid  with  n  points,  by  a  factor  e,  is 
O(n|loge|).  Bakhvalov  [1]  generalized  this  result  to  any  second-order 
elliptic  operator  with  continuous  coefficients.  For  large  n,  this  is  the 
best  possible  result  -  except  for  the  actual  value  of  the  coefficient.  The 
Fedorenko  estimate  can  be  written  as 

W(n,.01)  £  210000n  +  W(1C^  ,.01) , 

and  the  Bakhvalov  constants  are  still  much  larger.  For  admissible  values 
of  n  these  estimates  are  therefore  far  worse  than  estimates  obtained  in 
other  methods,  and  they  did  not  encourage  any  development  of  the  method. 
Fedorenko  experimented  with  a  two-level  algorithm  only,  and  seemed  to  imply 
that  for  practical  grid  sizes  ADI  may  be  more  efficient.  He  did  not  realize 
the  true  practical  potential,  in  both  efficiency  and  programming  simplific¬ 
ation,  of  a  full,  systematic  multi-grid  apjgroach.  (It  can  be  proved  that 
W(n,.01)  ^106n,  and  in  practice  W(n,.01)  5Qn  is  obtainable.  See  App,  C) . 

The  first  full  multi-grid  algorithms  and  numerical  tests  were  described 
in  [2] .  Ovir  original  approach  was  to  regard  the  finer  levels  as  "correcting" 
the  coarser  level  (cf.  Secs.  1,  7.2  and  7.5  above).  For  uniform  non-adaptive 
grids  this  approach  turns  out  to  be  equivalent  to  the  one  implied  by  [6] , 
but  fvindamentally  it  is  different  and  more  powerful,  since  the  process  is 
not  confined  to  a  fixed  discrete  system. 

A  systematic  multi-grid  approach  for  a  restricted  class  of  problems , 
with  somewhat  different  procedures  of  relaxation  and  transfer  to  coarser 
grids,  is  described  in  [21].  The  multi-grid  method  is  also  portrayed  in  [23]. 

Adaptive  discretization  procedures  were  introduced  by  several  authors. 

See  for  example  [10] ,  [20] ,  [21]  and  references  in  [21] .  The  present  approach 
is  different,  not  only  in  its  multi-level  setting,  but  also  in  its  basic 
criteria  and  procedures. 
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APPENDIX  A.  INTERPOLATIONS  AND  STOPPING  CRITERIA;  ANALYSIS  AND  RULES. 

The  multi-grid  algorithms  described  above  (Secs.  4  and  5)  need  to  be 
supplemented  with  some  rules  of  interpolations  and  stopping  criteria. 

More  specifically,  for  the  interpolation  I  ^  ,  transfer^inu  weighted 

residuals  from  a  fine  grid  G  to  thg  next  coarser  grid  G  ,  we  should 
prescribe  the  weights,  while  for  interpolating  corrections  from  G 

back  to  G^,  the  method  and  order  of  interpolation  should  be  prescribed. 
Stopping  criteria  should  define  convergence  at  the  various  levels  and 
detect  slow  convergence  rates.  Numerical  tests  show  that  the  parameters 
to  be  used  are  very  robust:  Full  efficiency  of  the  multi-grid  algorithm 
is  obtained  for  stopping  parameters  that  do  not  depend  on  the  geometry 
and  the  mesh  size,  and  which  may  change  over  a  wide  range  (see,  e.g. , 
Appendix  B)  ,  provided  the  correct  foms  of  the  stopping  criteria  are 
used,  and  some  basic  rules  of  interpolation  are  observed.  To  find 
the  correct  forms  and  rules ,  and  to  determine  the  stopping  parameters , 
we  have  to  analyze  Coarse-Grid  Correction  (CGC)  cycle,  ^hjch  consists 

of  interpolating  (1  ^  )  the  residuals  to  the  coarser  gjcid  G  ,  where  the 
residual  problem  is  solved,  a^d  then  interpolating  that  solution 

back  as  a  correction  to  the  G  approximation. 
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We  can  use  a  local  mode  analysis  (for  the  linearized,  coefficient- 
freezed  difference  equations) ,  similar  to  the  example  in  Sec.  3.  Such 
an  analysis  may  be  inaccurate  for  the  lowest  frequency  modes,  for  which 
the  interaction  with  the  boundary  is  siqnificant.  But  these  lowest  modes 
are  of  little  significance  in  our  considerations,  since  they  are  efficiently 
approximated  on  the  coarsest  grids  with  little  computational  work,  and 
since  care  will  be  taken  (i)  to  choose  interpolation  schemes  that  do  not  con¬ 
vert  small  low-frequency  errors  into  large  high-frequency  errors;  and  (ii)  to 
stop  relaxation  sweeps  before  low-frequency  error  coinponents  become  so  large 
that  they  significantly  feed  the  high  frequencies  (e.g. ,  by  botindary  and 
non-linear  interactions).  In  fact,  we  will  see  that  tbe  dominant  components 
(i.e.,  the  components  that  are  slowest  to  converge  in  the  combined  procep^^ 
of  relaxation  and  coarse-grid  corrections)  are  the  Fourier  components  e 
for  which  1 6 1  is  close  to  pir  ,  where  (in  a  general  d-dimensional  problem) 

d 

(A.O)  0  =  (9,,0,,  ...,e,),  e*X  =•  E  ex.,  |0|  =  max  | 0 ^ |  , 

12  a  j  3 


h  -  hj^  phj^_l 


These  components  feed  on  each  other  in  the  interpolation  processes  between  G 
and  G^~^,  they  are  slower  to  converge  by  relaxation,  and  in  the  CGC  cycles 
they  may  even  diverge. 

To  simplify  the  discussion  we  will  assume  that  the  mesh-^ize  ratio 
has  its  usual  value  r  which  is  the  only  one  to  he  used  in  practice 

(cf .  Sec.  6.2) . 


A.l.  Coarse  Grid  Amplification  Factors.  For  any  given  set  of  dif- 
ference  operators  and  a  multi-grid  scheme,  a  local  mode  analysis  of 
the  complete  MG  cycle  can  be  made  (cf .  App.  C) ,  and  the  various  parameters 
can  be  optimized.  The  essential  information  can,  however,  be  obtained 
from  a  much  simpler  analysis  that  treat  separately  the_tw  main  procepes, 
relaxation  sweeps  and  CGC  cycles.  The  smoothing  rate  y  (see  Sec.  3)  is  the 
main  quantity  describing  the  relaxation  sweeps.  The  CGC  local  mode  analysis 
is  summarized  below  (for  algebraic  details  see  Sec.  4.5  of  [3]). 

i0*x/h 

In  the  CGC  analysis,  together  with  each  basic  Fourier  component  e 
(O  <  1 0 1  <  ^  )  we  should  treat  all  the  G  components  that  coincide  with 

it  on  G^"^,^i.e.,  all  components  (O  <  \Q'\  £  tt)  such  that  _ 

0:  =  0.(mod  IT)  for  j=l,2,...,d.  We  call  such  component  0  a  hancynic 
oi  0.  ^We  are  especially  interested  in  those  harmonics  that  are  not 
separated  from  0  by  the  relaxation  sweeps,  e.g. ,  the  set 


T„  = 


0'  E  0  (mod  ir)  :  y(0")  >  y(e) 
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Denote  by  |Tg|  the  nijinber  of  members  in  this  set.  (Usually  |Tg(  =2“  , 

where  a  is  the  number  of  coordinates  j  for  which  1 6  j |  ^  y  )  .  In  terms 
of  the  e  Fourier  components  and  its  harmonics,  the  CGC  cycle  has  two 
effects : 

(i)  Assuming  the  components  not  in  to  be  comparatively  small  when 
the  CGC  cycle  is  entered,  the  set  of  components  in  Tg  is  transformed  in 
the  cycle  by  a  certain  matrix,  whose  spectral  radius  turns  out  to  be 


(A.l) 


where 


0(0)  = 


oAq)  , 


if  Tgl  =  1, 


"ax  (1,  o  (0)),  if  |t  I  >  1, 


(A. 2) 


o  (0)  =  |l  -  I  R(0,0')  B,  (0')  B,  ,(20)  ^  p(0')  . 


The  functions  P(0"),  R(0,0')  and  B. (0)  are  the  "symbols"  of 

Z  ^  k 

and  L  ,  respectively ,  i,e. , 


ik-l  =  p(e')  e^®*^ 


(cf.  (A. 10)  below) - 


(A,  3) 


_  k  i0*x/h 
k-1  ® 


S  R(e,e")  e 

6^  =  0  (inod  tt) 


i0^*x/h 


0 

L  e  =  B^(0)  e 


i0*x/h 


(£=  k,k-l) . 


(If  L  is  a  system  of  equations,  and  the  right-hand  side  of  (A. 2)  is  there¬ 
fore  a  matrix,  then  cj  (0)  is  meant  to  be  the  spectral  radius  of  that 
matrix).  For  small  |e|  we  have  |t  I  =1  and  hence 


(A. 4) 


0(0)  =  O  (0)  =  1  -  p(o)  +  O  (|0|^  +  |0|^)  , 


where  p  is  the  approximation  order  of  L  and  L,  ,  (or  the  minimum  of  the 

k 

two)  ^d  I  is  the  order  of  the  1  interpolation  (1=2  for  linear  inter¬ 
polation,  etc.).  The  principal  CGC  amplification  factor  is 

(A .  5 )  a  =  max  a  ( 0 ) 

oieif 


max  (1,0^  )  , 


where 


max  a  (0) 

o-r 


560 


(ii)  The  CGC  cycles  also  generate  new  secondary  harmonics  . 

The  rate  of  generating  these,  i.e.,  the  ratio  of  the  new  6  amplitu  e  to 
the  old  amplitude  of  the  combined  harmonics,  turns  out  to  be 

(A.6)  =  |R{9,e”) 

=  0(1  , 


where  m  is  the  order  of  the  differential  equations. 

It  follows  from  (A.4,6)  that  if  p (o)  =  1,  as  it  is  always  chosen 
to  be  (cf.  Sec.  A.4) ,  and  if  I^m,  then  components  with  small  |e|  are  very 
efficiently  reduced  in  the  multi-grid  process. 


A. 2.  The  Coco:se-to-Fine  Interpolation 


On  the  other  hand,  it 


follows  from  (A.6)  that  if  Km  then  even  a  small  and  smooth  residual 
function  may  produce  large  high-frequency  residuals,  and  significant  arount 
of  computational  work  will  be  required  to  smooth  them  out.  This  effect 
was  clearly  shown  in  numerical  experiments  ([2],  [11]).  Hence  we  ave 

The  Basic  Rule;  The  order  of  interpolation  should  be  no  less  than  the_ 
order of  the  differential  equations.  (KM.)  In  particular,  polynomina 
interpolation  should  be  made  with  polynomials  of  degree  ^m-l. 

Higher  interpolation  orders  (I>M)  are  desired  in  the  initial  stages 
of  solving  a  problem,  when  the  residuals  are  (locally)  smooth.  For 
instance,  in  regions  where  the  given  problem  has  smoothness  of  order  q 


(i.e.,  F(x)  =  E  Aq  e 


i6«x/h 


A 


=  oder'^h'^))  /  in  order  to  ensure  that 


the  high-frequency  residuals  remain  O(h^) ,  at  the  i-th  interpolation  from 


to  11^  the  order  should  be 
(A. 7)  I  ^m  +  max[q- (i-l)Pf  o] . 

(In  fact,  as  long  as  q  >  ip,  this  interpolation  need  not  be 
by  relaxation  sweeps,  since  the  low-frequency  ^plitudes  are  Jtill 
Sminant.  Relaxation  would  only  feed  from  these  low  com^nents  to  high 
frequency  ones,  causing  additional  work  later.  Still  jgetter,  hwever, 
instead  of  this  multi-grid  mode  without  intermediate  G  relaxation, 
is  to  make  a  higher-order  correction  on  G  ) . 

Eventually,  however,  the  smoothness  of  F  (which  is  the  original  re¬ 
sidual  function)  is  coit5)letely  lost  in  subsequent  and  the  con¬ 
vergence  of  coinponents  in  the  dominant  range  (|e|  j  )  becomes  our  ma  n 

concern.  For  these  components,  higher  interpolation  orders  (I>m) 
more  effective  than  the  minimal  order  (I=m) .  This  again 

liT  numerical  experiments  ([2],  [10])  ,  which  confined  that  the  multi-grid 

efficiency  is  not  inqproved  (except  in  the  fq/pl  first  eye  es)  y  g 
I  >  in. 
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^  method  to  implement  high-order  interpolations  in  case  of 

equations  of  the  form  A  U  =  F  is  to  base  the  interpolation  on  suitably- 
rotated  difference  approximations.  See  [14,  p.  53]  and  [7]. 

— The  Effective  Smoothing  Rate.  The  smoothing  rate  p  was  defined 
in  (3.8)  as  the  slowest  convergence  rate  for  all  components  not  represented 
at  the  coarser  level.  More  relevant,  however,  is  the  slowest  rate  among 

all  components  for  which  the  coarse-grid  correction  is  not  effective 
namely , 

(A. 8)  p  =  max  {y{9)  :  1  ^  [e|  ^  ^  or  ^  ^ 

which  we  call  the  "effective  sitKxsthing  rate".  It  is  clear,  on  one  hand, 
that  go  rate  faster  than  p  can  be  generally  obtained  as  rate  of ^convergence 
G  relaxation  sweep,  no  matter  how  well  and  how  often  the  problem 

IS  solved.  On  the  other  hand,  the  rate  p  can  actually  be  attained  (or 
approached)  by  correctly  balancing  the  number  of  relaxation  sweeps  in 
between  CGC  cycles  (see  Sec.  A. 6).  In  most  cases  (all  cases  examined  by 
ns)  one  can  make  a^(6)  <  1  for  all  | 6 |  <  |^  by  proper  choice  of  I^~^(see 
Sec.  A. 4),  and  it  is  therefore  justifiable  to  use  p  as  the  effective 
rate  when  relaxation  schemes  are  studied  by  themselves. 


A, 4.  The  Fine-tO“Coarse  Weighting  of  Residuals  (I^  ^) ,  and  the 


k-1 


The  transfer  of  the  G  residuals 
k-1 


Coarse-Grid  Operator  L 

k  k  k  k  k  1 

^  ^  -  L  u  to  the  coarser  grid  G  ,  to  serve  there  as  the  right-hand 

^dj  f  (see  Sec.  4,  Step  e)  can  be  made  in  many  ways.  Generally 

t  is  defined  as  some  weighted  average  of  the  residuals  in  neighboring  G^ 
points : 


f  (x)  -  I  ^  r^{x)  =  2  p  r^(x+vh), 

K  V 

where  ’ ''^d^  '  integers,  and  the  summation  is  over  a  sm.all 

set.  In  terms  of  these  weights,  p  (6)  in  (A. 2)  is  given  by 


(A. 10)  p(e)  =  i:  Py  . 

The  coarse  grid  operator  ^  can  also  be  chosen  in  many  ways,  e.g., 
as  some  weighted  average  of  the  operator  L  in  neighboring  points. 

How  are  these  choices  to  be  made?  The  main  purpose  should  be  to  min¬ 
imize  0,  but  without  investing  two  much  computational  work  in  the  weighting. 
Usually,  it  is  preferable  to  adjust  p  and  not  L  because  this  provides 
enough  control  on  a  (cf.  (A  2))  and  because  complicating  ^  adds  many 
more  computations  and  gets  increasingly  complicated  as  one  advances  to  still 
coarser  levels.  For  the  programmer,  using  the  same  operators  at  all  levels 
is  an  important  simplification  (cf .  App.  B) ,  especially  for  non-linear  pro¬ 
blems  . 
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It  is  clear  from  (A. 4)  that  we  should  take  p (o)  I  p 
no  apriori  restriction,  however,  on  the  signs  of  the  weights  p 
trivial  weighting 


There  is 
The 


(A. 11)  P^=l,  Py=0  fox  p(0)  =  1, 

called  injection,  has  an  important  advantage  in  saving  computations,  not 
only  because  the  weighting  itself  is_saved,  but  mainly  because  it  requires 
the  computation  of  f  only  at  the  points,  while  other  weighting  schemes 

compute  r^  at  all  points,  an  additional  work  comparable  to  one  G  relax 
ation  sweep. 

Examples.  For  symmetric  second-order  equations,  injection  should 
usually  be  used.  For  the  5-point  Laplace  operator,  fgr^example,  if  we  take 
to  be  injection,  linear  interpolation  and  L  also  a  5-point 

Laplace  operator,  we  get  a  =  cr  =  1/  the  minimal  poss^le  value.  Any 
weighting  is  a  pure  waste,  including  the  "optimal"  weighting 


(A. 12) 


_  1 

^01  "  ^0-1  “  *^10  "  ‘^-10  8  ' 


p  =  o  for  I  ot  I  + 1  6 1  >  1  / 


which  minimized  5  ,  giving  |  «  but  does  not  lower  a.  Numerical  tests 

(modifying  the  program  of  Appendix  B)  indeed  showed  no  improvement  by 
weighting.  If,  however,  the  equation  has  strong  variation,  making 
quite  different  from  Bj^,  we  may  get  for  injection  a  =  >  1,  while 

weighting  (A. 12)  will  keep  safely  below  1,  giving  0=1. 

For  higher-order  equations,  non-trivial  weighting  offers  an  important 
advantage.  If,  for  exaitple,  L^  and  are  13-^ints  biharTnonic_operators 

and  I  ^  is  cubic  interpolation,  then  0=3  for  injection,  while  o  1 
for  t]ie  weighting 


_  1 

PQI  =  ^0-1  =  f^lO  -  P-10  ~  4  ' 


p  „  =  0  for  |al  +  lB|  1- 


A. 5.  Finite  Elements  Procedures.  The  main  difference  between  finite- 
element  and  finite-difference  multi-grid  procedures  is  in  the  interpolation 
schemes.  In  the  finite-element  case,  interpolation  procedi^es  follow  auto¬ 
matically  from  the  variational  formulation  and  t^e  definition  o|  the  approx¬ 
imation  spaces  (corresponding  to  the  levels  G  )  .  Usually,  S  is  a  s^- 
space  of  .  The  coarse-to-f ine  interpolation  is ,  therefore ,  simply  the 

identity  operation.  Also,  if  the  variational  problem  in  S  is  to  minmaize 
A^(V^),  then,  for^any  given  approximation  v  ,  the  correction  problem  in  t  e 
TOarser  space  ^  is,  simply,  to  minimize 

(8.13)  ^  ^)  . 
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Example ,  Consider  the  standard  example,  where  is  the  space  of 
piecewize  linear  functions  on  the  triangulation  and  is  a  Dirichlet 
ij^^g^sl^whoge  minimization  is  equivalent  to  the  difference  equation 
A  V  =  F  ,  A  being  the  5-point  Laplacian.  Computing  ^  ^  (A,B) ,  it 


turns  o^t^to  be  equivalent  to  the  equation  A 
where  I  ^  has  the  weights  (cf.  Sec,  A,4) . 


k-1  .  k-1 


=  I 


k-1 


_k  ^k  K 
(F  -A  V  ) 


00 


01 


=  Pll  -  P 


10 


=  P 


0-1 


=  p 


-1  -1 


=  p 


-10 


These  weights  give  the  same  multi-grid  convergence  rate  as  injection  (and 
are,  therefore,  redundant). 


A. 6.  Criteria  for  Slow  Convergence  Rates 


(A)  Relcixation  sweeps , 


say  on  G  ,  should  be  discontinued,  and  a  switch  should  be  made  to  a  coarse- 
grid  correction,  when  the  rate  of  convergence  becomes  slow;  e.g. ,  when 


(A. 14) 


residual  norm 


residual  norm  a  sweep  earlier 


>  Tl  E 


-3 
CT  3p 


The  norm  here  is  a  suitable  (e.g.,  L  ,  or  (A. 18))  discrete  measure, 
usually  of  the  "dynamic"  residuals,  "^that  is,  residuals  computed  incidentally 
to  the  relaxation  process.  y  and  a  are  defined  in  (A. 8)  and  (A.5) ,  respec- 
txvel^.^  Usually,  one  can  choose  the  I  weighting  so  that  a=l ,  in  which 
case  in  any  case,  (A, 14)  is  designed  to  ensure  that,  on  one  hand, 

the  CGC  cycle  is  delayed  enough  to  make  its  a  magnification  small  compared 
with  the  intermediate  reduction  by  relaxation  sweeps.  On  the  other  hand, 
for  0  with  y(e)  considerably  slower  than  y,  the  CGC  cycles  are  still  suf¬ 
ficiently  frequent  to  compensate  for  the  slower  y ,  since  their  reduction 
rate  o (0)  decreases  rapidly  ((A. 4)  with  p  (0)=1) .  the  stopping  rule  (A. 14) 
also  prevents  low  error  frequencies  from  dominating  relaxation,  thus 
avoiding  significant  feeding  from  low  to  high  frequencies  (through  boundary 
and  nonlinear  interactions) . 

If  the  stopping  rate"  r\  varies  over  the  domain  of  computations  (as  a 
result  of  variations  in  L,  in  case  of  nonlinear  or  non-constant-coefficients 
problems),  the  largest  n  should  be  chosen  for  the  stopping  criterion  (A. 14). 
If  log  n  changes  too  much  over  the  domain  (which  should  not  happen  when 
a  proper  relaxation  scheme  is  used) ,  then  (A.  14)  must  be  checked  separately 
in  subdomains,  and  partial  sweeping  (see  Sec.  A. 9)  might  be  used. 

An  appropriate  value  of  ri  may  also  easily  be  found  by  direct  trial  and 
error.  Such  value  is  typical  to  the  (locally  linearized,  coefficient- 
freezed)  problem,  is  independent  of  either  h,  or  F,  and  may  therefore 
be  foxind,  once  for  all,  on  a  moderately  coarse  grid.  In  some  nonlinear 
problems  the  value  may  need  some  adjustment  as  the  computations  proceed. 
Whenever  the  coarse-grid  corrections  seem  to  be  ineffective,  n  should  be 
increased,  e.g.,  to  (1+3ti)/4.  Generally,  the  overall  multi-grid  con¬ 
vergence  rate  is  not  much  sensitive  to  increasing  X):  At  worst,  the  rate 
may  become  r)  instead  of  the  theoretically  best  rate  meix  y  ^  (cf. 

Sec.  6.2).  ^ 
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For  the  Poisson  ocruation  with  Gauss-Seidel  relaxation,  for  example, 

we  have  5=1,  y=5=  -5,  hence  ti=.6^^v4  example  ^  B  ^ho^  t^at 

the  optimal  MG  convergence  rate  'v,  .595  is  indeed  attained.  Experiment¬ 

ing  with  this  program  gave  similar  results  for  any  smaller  n  (the  reason 
being  that  the  minimal  number  of  two  sweeps  at  each  level  is  good  enough 
in  this  problem),  while  for  any  ri£.95  the  total  amount  of  computational 
work  was  no  more  than  twice  the  work  at  x\=.&2. 

(B)  Jhiother  way  to  decide  upon  discontinuation  of  relaxation  is  to 
directly  measure  the  smoothness  of  the  residuals.  The  switch  to  coarser 
grids  can  be  made,  for  instance,  when  differences  between  residuaJ,s  at 
neighboring  points  are  small  compared  with  the  residuals  themselves. 

A. 7.  Convergence  Criteria  on  Coarser  Grids.  In  the  C^^^mode  analysis 
above  it  was  assumed  that  the  problem  on  the  coarser  grid  G  was  fully 
solved  and  then  interpolated  as  a  correction  to  the  G  gpjroxirotion.  In 
the  actual  multi-grid  algorithm  (Sec.  4)  we  solve  the  G  problem  iter¬ 
atively,  stopping  the  iterations  when  some  convergence  criterion  is  met. 
This  criterion  should  roughly  detect  the  situation  atj^which  irare  improve¬ 
ment  (per  unit  work)  is  obtained  by  relaxing  on  the  G  grid  (after  inter- 
pointing)  then  by  further  iterating  the  g""  problem  (before  interpolating) 
A  crude  mode  analysis  (similar  to  Sec.  4.6.2  in  [3])  shows  that  sue  a  cri 

ter ion  is 


(A. 15) 


a  (1  -  y. 


6  = 


-d 


s  (y. 


-  Vl^ 


where  d  is  the  dimension,  5  is  given  by  (A.5) , 


s  = 


max 

.V  I  ^ 


e*£T 


R(e,e’)  Bj^(e')  Bj^_j^(2e)  ^  p(0') 


k-l 


I  I  is  any  norm  of  the 


and  y  =  y^^"^  on  the  G^  grid  (cf.  (A.8)).  1 jr 

current  residuals  in  the  g’""^  problem,  while  | Ir^I |  is  the  corresponding 
norm  in  the  G^  problem.  It  is  important  that  these  norms  are  coiti)arable . 

They  should  be  discrete  approximations  to  the  same  continuxjm  norms.  Al?o, 
if  the  "dynamic"  residuals  (i.e.,  computed  incidentally  to  the 

last  G^~^ relaxation  sweep,  using,  latest  available  values  of  the  relied 
solution)  then  r^  should  be  the  G  dynamic  residuals,  unlij^  the  residuals 

transferred  to  G^^"^  (to  define  cf.  Sec.  A. 4)  which  must  be  "static" 

residuals  (i.e.,  convputed  over  the  gridwithout  changing  the  solution  at 
the  same  time).  If,  however,  r^  and  are  static  and  dyn^ic,  respectively, 

the  parameter  6  in  (A. 15)  should  be  multiplied  by  a  certain  factor  B  (see 
Sec.  4.6.2  in  [3]). 
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The  stopping  ^it|rion  (A.  15)  is  based  on  the  assxjmption  that  error 
coii5)onents  with  |e|  -  —  dominates  the  process.  In  the  first  fq/pl  CGC 
(^cles,  however,  lower  components  are  dominant,  and  the  main  consideration 
IS  to  converge  them.  Hence,  at  that  initial  state,  the  ^  convergence 
criteria  should  be 


(A. 16) 


k-1 


k-1 


where  x  are  the  G  truncation  errors  (cf 


Sec.  A. 8). 


^  The  key  factor  6  can  also  be  found  by  trial  and  error.  Like  n  above. 
It  IS  essentially  independent  of  h,  fi  and  F,  and  may,  therefore,  be  found 
once  for  all  by  tests  on  moderately  coarse  grids.  Numerical  experiments 
^ow  that  the  overall  multi-grid  efficiency  is  not  much  sensitive  to  very 
large  variations  in  6  and,  in  particular,  6  may  be  lowered  by  orders  of 
magnitudes  without  large  changes  in  the  efficiency.  For  example: 

For  the  5-points  Poisson  equation  with  Gauss-Seidel  relaxation,  in¬ 
jection  and  linear  interpolations,  (A. 15)  yields  6  =  .219.  Numerical 
e:qjeriment  (e.g. ,  with  the  program  in  Appendix  B)  show  that  with  any 
.001  _<  p  _<  .5  the  computational  work  is  no  more  than  25%  above  the  work 
with  p  =  .22,  and  no  more  than  100%  extra  work  for  any  .0001  <  p  <  .7. 


Convergence  on  the  Finest  Grid.  On  the  finest  grid  G^  the  so¬ 
lution  is  usually  considered  converged  when  the  (static)  residuals  are  of 
the  order  of  the  truncation  error,  in  some  appropriate  norm.  One  way  to 
estimate  the  truncation  error  is  to  measure  them  on  coarser  grids  by  (5.7) , 
and  extrapolate  (taking  into  account  that  they  are  0(h^)).  Another,  related 
but  more  straightforward  criterion  is  to  detect  when  the  g”  solution 
has  contributed  most  of  its  correction  to  the  G^~^  solution.  In  the 
FAS  algorithm  the  natural  place  to  check  is  when  a  new  F®“^  is  computed, 
the  convergence  test  being  ' 


(A.17) 


=M-1 


IJM 


previous 
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F 


-  ^  M  ^  I 


^e  norm  here  may  be  any  (L  ,  L^,  etc.),  but  the  most  relevant  one 
is  the  discrete  version  of  the  norm  (cf.  Sec.  8.1) 

l|f||  =  /  G(x)  (f(x)|dx 

Partial  Relaxation  Sweeps.  A  partial  re letxat ion,  sweep  over 
G  is  a  relaxation  sweep  that  may  skip  some  subdomains  of  G^.  (Unlike 
"selective"  relaxation  sweeps,  which  in  principle  pass  through  all  the 
grid  points,  although  corrections  may  not  be  introduced  in  some  of  them. 
Cf.  Sec.  3.2.  A  peirtial  OTeep  may  be  selective,  too.) 

Partial  sweeps  are  not  used  much  in  standard  relaxation  calculations. 
Usually,  a  s low- to-con verge  subdomain  is  coupled  to  other  subdomains 
tJisj^sfore  cannot  be  relaxed  separately.  In  the  multi— grid  process, 
however,  only  high-frequency  error  con5>onents  are  to  be  reduced  by  relax- 
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ation,  and  this  can  be  done  separately  in  subdomains:  With  regard  to 
high-frecpiencies ,  subdomains  are  practically  decoupled.  Hence,  in  the 
multi-grip  process,  partial  sweeps  are  potentially  very  iirportant.  In 
fact,  high-freguency  amplitudes  may  vary  greatly  over  the  domain,  especially 
if  V  and  a  vary  inuch,  or  if  hi^-frequency  error  congponents  are  introduced 
at  boundaries,  naking  partial  sweeping  there  very  desirable. 

Partial  sweeping  may  be  performed  by  applying  a  criterion  for  slow 
convergence  (Sec.  A. 6)  separately  in  subdomains.  (If  the  connected 
region  of  partial  relaxation  is  small,  n  in  (A. 14)  should  be  changed  to 
(a  5+3y)/(a  +3) ,  where  v  is  the  largest  amplification  factor  for  Fourier 
components  on  the  relaxed  region.)  A  subdomain  may  be  excluded  from 
subsequent  relaxation  sweeps  if  slow  convergence  is  shown  simultaneously 
on  that  subdomain  and  on  all  neighboring  subdomains.  Under  relaxation 
may  be  used  to  phase-out  the  relaxed  region  (cf.  [3] ,  Sec.  4.6.4).  The 
subdomains  may  be  chosen  quite  arbitrarily,  but  each  of  them  should  be 
large  enough  (at  least  4x4)  to  allow  for  separate  smoothing. 


A. 10.  Convergence  Criteria  on  Non-uniform  Grids 

When  and  are  not  coextensive  (i.e.,  the  domain 

covered  by  g’"  is  only  part  of  the  G^'^  domain;  cf.  Sec.  7.2) , 
the  convergence  criteria  (Secs.  A.7-8)  should  be  slightly  modified. 
First,  in  (A.15),  | Ir^^i |  is  not  a  comparable  norm,  since  it  may  be 
measured  on  a  much  narrower  subdomain.  Instead,  one  can  use  the 


test 
(A. 19) 


||r^-lll  <  6llrf^||/n, 


k-1 


where 


I  is  the  residual  norm  computed  on  G 


k-1 


at  the  first 


relaxation  sweep  after  switching  from  G  .  The  division^by  n  in 


(A. 19)  is  designed  to  compensate  for  the  fact  that  ] |r^  ||  is 


computed  a  sweep  later  than  | | r  | 1 . 

The  other  modification  is  in  (A.17) ,  where  it  was  assumed  that 
is  the  finest  level  everywhere.  Generally,  the  convergence 
test  can  be^  for  example, 

for  all  k  =  (0,1 


(A. 20) 


f’^  -  F^  . 

previous 


<< 


^k+1 


where  the  norms  are  taken  over  (or,  more  precisely,  over 


°k+2^ 


,M-1) , 
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^PENDIX  B:  SAMPLE  MULTI -GRID  PROGRAM  AND  OUTPUT » 

^  This  simple  program  of  Cycle  C  (written  in  1974  by  the  author  at  the 
Weizmann  Institute)  illustrates  multi-grid  programming  techniques  and 
exhibits  the  typical  behavior  of  the  solution  process.  For  a  full 
description  of  Cycle  C,  see  Sec.  4  or  the  flowchart  in  Pig.  1. 

The  program  solves  a  Dirichlet  problem  for  Poisson  equation  on  a 
rectangle.  The  same  5-point  operator  is  used  on  all  grids.  The 
residuals  transfer  is  the  trivial  one  (injection) ,  the  I interpolation 
IS  linear.  The  higher  interpolation  (A. 7)  and  the  special^stopping 
criterion  (A. 16),  recommended  for  the  first  [q/p]  cycles,  are  not  implemented 


(h=l,2,...,M).  For  handling 

e  arrays  f  is  also  called  v  .  The  coarsest  grid  has  NXO  x  NYO 
^tervals  of  length  HO  each.  Subsequent  grids  are  defined  as  straight  re¬ 
finements,  with  mesh  sizes  H(k)  =  H0/2**(k-l).  The  function  F(x,y)  is  the 
right-hand  side  of  the  Poisson  equation.  The  function  G(x,y)  serves  both 
Mj^the  Dirichlet  boundary  condition  ($  )  and  as  the  first  approximation 
(u^) .  The  program  cycles  until  the  L  norm  of  the  residuals  on  is  re¬ 
duced  belo^  TOL,  unless  WORK  exceeds  Wmax.  After  each  relaxation  sweep  on 
any  grid  G  ,  a  line  is  printed  out  showing  the  level  k,  the  L  norm  of  the 
(  dynamic  )  residuals  computed  in  course  of  this  relajxation ,  and  WORK, 
which  is  the  accumulated  relaxation  work  (where  a  sweep  on  the  finest 
grid  is  taken  as  the  work  unit) . 


Note  the  key  r^le  of  the  GRDFN  and  KEY  subroutines.  The  first  is  used 
to  define  a  grid  (v  ),  i.e.,  to  allocate  for  it  space  in  the  general  vector 
Q  (where  IQ  points  to  ^he  next  available  location) ,  and  to  store  its  para¬ 
meters.  To  use  grid  v  ,  CALL  KEY(k,IST,M,N,H)  retrieves  the  grid  para- 
mejters  (dimension  MxN  and  mesh-size  H)  and  sets  the  array  IST(i)  so  that 
“  Q(IST(i)+j).  This  makes  it  easy  to  write  one  routine  for  all  grids 

V  ;  see  for  example.  Subroutine  PUT2(k).  Or  to  write  the  same  routines 
(RELAX,  INTADD,  RESCAL)  for  all  levels. 


To  solve  on  the  same  domain  problems  other  than  Poisson,  the  only 
subroutines  to  be  changed  are  the  relaxation  routine  RELAX  and  the  re¬ 
sidual  injection  routine  RESCAL,  the  latter  being  just  a  slight  variation 
of  the  first. 


For  different  domains,  more  general  GRDFN  and  KEY  subroutines  should 
be  written.  A  general  GRDFN  subroutine,  in  which  the  domain  characteristic 
function  is  one  of  the  parameters,  has  been  developed,  together  with  the 
corresponding  KEY  routine.  l*his  essentially-^ reduces  the  programming  of 
any  multi-grid  solution  to  programming  a  usual  relaxation 
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program  cycle  c 

EXTERNAL  G,F 

CALL  MULTIG  (3, 2 , 1  . ,  6 ,  .  01 , 30 .  ,G ,F) 

STOP 

END 

FUNCTION  F  (X,Y) 

F  =  SIN  (3.*(X+Y)) 

RETURN 

END 


CYCLE  C 


Right-hand  side  of  the  equation 


Boundary  values  and  first  approximation 


FUNCTION  G(X,Y)  Bounaary  vaxues  .in-  =  -  - 

G=COS  (2.*  (X  +  Y)  ) 

RETURN 

END 

SUBROUTINE  MULTIG  (NX0,NY0 ,H0 ,M,T0L,HMAX,U1 ,F) 

DMEBSIO1i“ePS(10)  Multi-grid  algorithm  (see  rig.  1) 

DO  1  K=1,M 
K2  =  2**(K-1) 

CALL  GRDFN  (K,NX0*K2+1 ,NY0*K2  +  1 ,H0/K2) 

1  CALL  GRDFN  (K  +  M,NXC*K2+1  ,NYC*K2+1  ,HC/'K2) 

EPS(M)=TOL 

K=M 
HU  =  0 

CALL  PUTF(M,U1,0) 

CALL  PUTF  (2*M,F,2) 

5  ERR=1.E30 

3  ERRP=ERR 

CALL  RELAX (K,K*M, ERR) 

WU  =  WU  +  4.**  (K-M) 

WRITE  {6,4)K,2RR,WU 

4  FORMATC  LEVELSI2,'  RESIDUAL  NORM= 1  PE  1 0 . 3 ,  '  MORK=' ,  OPF?.  3) 

IF  (ERR. LT. EPS  (K)  )  GOTO  2 

IF  (WU.GE.WMAX) RETURN 

IF  (K.EQ.  1.0R.  ERE/EREP.LT,  .6)G0IO3  n=.6 

CALL  RESCAL (K,K+M,K+M-1) 

EPS  (K- 1)  =.  3*£RR 
K  =  K-1 

CALL  PUTZ(K) 

GOTO.  5 

2  IF  (K.EQ.M) RETURN 
CALL  IKTADD  (K,K+1) 

K=K+1 

GOTO  5 
END 


SUBROUTINE  GRDFN (N, IMAX, UMAX  ,HH) 

COMMON/GED/NST(20)  ,IMX(20)  ,JMX(20)  ,H  (20) 

DATA  IQ/V 

NST(N)=IQ 

IMX  (N)  =1MAX 

JMX  (N)  =JMAX 

H(N)=HH 

IQ=IQ+IMAX*JMAX 

RETURN 

END 


Define  an  IMAX  x  UMAX 
N 

array  v 


SUBROUTINE  KEY  (K  ,  1ST,  1  MA X  ,  JM  AX ,Hli) 
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COMilON/GRD/NST(2C)  ,IMX(20)  ,JMX(20)  (20) 

DIMENSION  IST(1) 

IMAX=IMX  (K) 

JHAX=JMX  (K) 

IS=NST  (K)  -JMAX-  1 
DO  1  I=1,IMAX 
IS=IS  +  JMAX 
1  ISr(I)=IS 
HH  =  H  (K)  ■ 

RETURN 
END 

SUBROUTINE  PUTF(K,F,NH)  K  NH  K 

COMMON  Q  (18000)  , 1ST  (600)  ■*-  H(K)  -f 

CALL  KEY  (K,IST,II,  JJ,H) 

H2=H**NH 
DO  1  1=1,11 
DO  1  J=1,JJ 
X=  (I-  1) 
y=(j-i)*H 

1  Q  (1ST  (I)  +J)  =F  (X,Y)*H2 
RETURN 
END 


Set  1ST  such  that 
v’^{I,J)=  QdSTd)  +  J)  , 

and  set  IMAX  =  IMX(K) 
JMAX  =  JMX(K) 

HH  =  H(K) 


SUBROUTINE  PUTZ  (K) 

COMMON  Q  (18000)  ,IST(200) 
CALL  KEY  (K,IST,II ,JJ,H) 
DO  1  1=1,11 
DO  1  J=1,JJ 
1  Q  (1ST  (I)  4J)  =0. 

RETURN 

END 


SUBROUTINE  RELAX  (K, KRHS, ERR) 

COMMON  Q  (18000)  ,I  ST  (200)  ,IRHS  (200) 

CALL  KEY  (K,IST,II,JJ,H) 

CALL  KEY  (KKHS,IEHS,II,JJ,H) 

11=11-1 

J1=JJ~1 

ERR=0. 

DO  1  1=2,11 
IR=IPHS(I) 

IO  =  IST  (I) 

IM=IST  (1-1) 

IP  =  IST  (1+1) 

DO  1  J=2,J1 

A=Q  (IR  +  J)  -Q  (IO  +  J+1)  -Q  (10+ J-1)  -Q(IM+J)  -Q(IP+J) 
EEE  =  ERR+  [k+U  .*Q(IO  +  J))*t2 
1  Q  (lO  +  J) =-.25*A 
ERR  =  SQRT  (ERR)  /H 
RETURN 
END 


A  Gauss-Seidel  Relaxation  sweep 

on  the  equation 

.  ,K  _  KKHS 
h 

giving 

ERR  =  I  I  residuals | | 

h 


SUBROUTINE  I N TA D D  (KC , KF) 

COMMON  0(18000)  ,ISTC(200)  ,ISTF(200) 
CALL  KEY  (KC,ISTC,IiC,JJC,HC) 

CALL  KEY  (KF,ISTF,IIF,JJF,HF) 

DO  1  IC=2,IIC 
IF  =  2*IC-  1 
JF  =  1 


Linear  interpolation  and  addition 
KF  ^  KF  KF  KC 

V  •«-  V  +  I  V 

KC 
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IFO  =  ISTF(IF) 

IFM  =  ISTF  (IF-1) 

ICO  =  ISTC  (IC) 

ICM=ISTC  (IC-  1) 

DO  1  JC=2,JJC 
JF=JF+2 

A  =  .  5*  (Q  (ICO+JC)  -fQ  (ICO  +  JC-1)  ) 

AM  =  .  5*  {Q  (ICM+JC)  +Q  (ICM+JC-1)  ) 

0(IF0+JF)  =  g  (IFOtJF) +Q(ICO+ JC) 

0  (IFM+JF)  =  Q(IFM+JF) +.5*  (Q(ICO+JC| +g(ICM+JC)  ) 

Q  (IFO+JF-1)  =Q  (IFO  +  JF-1)  +A 
Q(IFM+JF-li  =  Q  (IFM  +  JF-1) +.5*(A+AM) 

EBTUSN 

END 


SUBROUTINE  EESCAL  (KF, KRF , KRC) 

COMMON  Q  (13000)  ,IUF(200)  ,IRF  (200)  , IRC  (200)  injection 

CALL  KEY  (KF ,IUF,IIF,JJF,HF) 

CALL  KEY  (KRF,ieF,IIF,JJF,HF) 

CALL  KEY  (KEC,IRC,IIC, JJC,HC) 

IIC1=IIC-1  ^KRC  ^  jcoarse  ^^KRF  _  a 

JJC1=JJC-1  “ 

DO  1  IC=2,IIC1 
ICR  =  IRC  (IC) 

IF=2*IC-1 

JF=1 

IFR=IRF  (IF) 

IFO  =  IUF  (IF) 

IFM  =  IUF  (IF-1) 

IFP  =  IUF  (IF+ 1) 

DO  1  JC=2,JJC1 
JF=JF+2 

S  =  Q  (IFOtJF+  1)  +  Q  (IFO  +  JF-1)  +  Q  (IFM+ JF)  +  Q  (IFP+ JF) 

1  Q  (ICR+JC)=4.*  (Q(IFR  +  JF)  -S+ 4 .  *Q  (IFO+ JF)  ) 

RETURN 

END 
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LEVEL  6 
LEVEL  6 
LEVEL  5 
LEVEL  5 
LEVEL  4 
LEVEL  4 
LEVEL  3 
LEVEL  3 
LEVEL  2 
LEVEL  2 
LEVEL  3 
LEVEL  4 
LEVEL  4 
LEVEL  5 
LEVEL  5 
LEVEL  6 
LEVEL  6 
LEVEL  6 
LEVEL  5 
LEVEL  5 
LEVEL  4 
LEVEL  4 
LEVEL  3 
LEVEL  3 
LEVEL  2 
LEVEL  2 
LEVEL  1 
LEVEL  1 
LEVEL  2 
LEVEL  3 
LEVEL  4 
LEVEL  4 
LEVEL  5 
LEVEL  5 
LEVEL  6 
LEVEL  6 
LEVEL  6 
LEVEL  5 
LEVEL  5 
LEVEL  4 
LEVEL  4 
LEVEL  3 
level  3 
LEVEL  2 
LEVEL  2 
LEVEL  2 
LEVEL  3 
LEVEL  4 
LEVEL  4 
LEVEL  5 
LEVEL  5 
LEVEL  6 
LEVEL  6 


RESIDUAL  N0FM=  2.S14E401 

MORK= 

1.000 

RESIDUAL  NDEM=  2.764E+01 

WOEK=^ 

2.000 

RESIDUAL  NDRH=  2.659E+01 

WORK= 

2.250 

RESIDUAL  NDRM=  2.555E+01 

WORK- 

2.500 

RESIDUAL  NDfiM=  2.317E+01 

WQEK= 

2.563 

RESIDUAL  N0EM=  2.095E+01 

W3EK= 

2.  625 

RESIDUAL  NDRM=  1.649E+01 

WORK== 

2.  641 

RESIDUAL  N0EM=  1.285E-r01 

WORK= 

2.  656 

RESIDUAL  NDP,M=  7.626E+00 

WOEK= 

2.660 

RESIDUAL  N0EM=  3.840E+00 

W0EK= 

2.  664 

RESIDUAL  NORM=  5.058E+00 

WORK= 

2.  680 

RESIDUAL  NORM=  8-006E+00 

WORK= 

2.  742 

RESIDUAL  N0EH=  2.545E+00 

WORK= 

2.  805 

RESIDUAL  N0RM=  9.736E+00 

WORK= 

3.055 

RESIDUAL  N0EM=  2.464E+00 

WOEK= 

3.  305 

RESIDUAL  N0EM=  1.064E+01 

WOEK= 

4.305 

xRESIDUAL  NDEM=  2.442E  +  00 

WORK= 

5.  305 

RESIDUAL  N0EM=  2.399E+00 

WOEK= 

6.305 

RESIDUAL  N3RM=  2.351E+00 

WOEK= 

6.555 

RESIDUAL  N0EM=  2.3C3E+00 

WORK= 

6.  805 

RESIDUAL  N3RM=  2.173E+C0 

WOEK= 

6.867 

RESIDUAL  N0fiM=  2.043L+00 

WOEK= 

6.  930 

RESIDUAL  N3EM=  1.739E+00 

WORK= 

6.945 

RESIDUAL  NORM=  1.453E+0G 

WORK= 

6.  961 

RESIDUAL  N3RM=  9.889E-01 

WORK= 

6.965 

RESIDUAL  N3EM=  6.183E-01 

WOEK= 

6.969 

RESIDUAL  N0RM=  2.760E-01 

WOP.K= 

6.970 

RESIDUAL  NORM=  5.170E-02 

WORK= 

6.971 

RESIDUAL  N3RM=  2.292E-01 

WOEK= 

6.975 

RESIDUAL  NOEM=  5.465E-01 

WORK= 

6.990 

RESIDUAL  N0EM=  7.71CE-01 

WORK= 

7.053 

RESIDUAL  N0EM=  1.163E-01 

WOEK= 

7.  115 

RESIDUAL  N3EM=  8.657E-01 

WQEK= 

7.  365 

RESIDUAL  N0EM=  1-058E-C1 

WORK= 

7.  615 

RESIDUAL  N3EM=  9.059E-01 

KORK= 

8.615 

RESIDUAL  N0EM=  1.052E-01 

W0RK= 

9.615 

RESIDUAL  N0RM5=  1.012E-01 

WORK= 

10.615 

RESIDUAL  N3RM=  9.759E-02 

WOEK= 

10. 865 

RESIDUAL  N3RM=  9.452E-02 

WOEK= 

11.115 

RESIDUAL  N0EM=  8.710E-02 

WOEK= 

11.  178 

RESIDUAL  N0EM=  7.960E-02 

WORK= 

1 1 . 240 

RESIDUAL  N0EM=  6.389E-02 

WOEK= 

11.256 

RESIDUAL  N3RM=  4.931E-02 

WOEK= 

11.271 

RESIDUAL  N0RM=  2.916E-02 

WOEK= 

1 1. 275 

RESIDUAL  NOEM=  1.622E-02 

WOEK= 

11.279 

RESIDUAL  NORM=  1.017E-02 

WORK= 

1 1. 283 

RESIDUAL  N3EM=  1.949E-02 

WOEK= 

11.299 

RESIDUAL  N0EH=  3.128E-02 

WOPK= 

11.361 

RESIDUAL  N3EM=  8.843E-03 

WOEK= 

1 1. 424 

RESIDUAL  N0EM=  3.710E-02 

WOEK= 

11-674 

RESIDUAL  N3KK=  8.486E-03 

WORK= 

1 1. 924 

RESIDUAL  N3EM=  4.0C7E-02 

WOEK= 

12. 924 

RESIDUAL  N0E!1=  9.051E-03 

WOEK= 

13. 924 

OUTPUT 

Error  reduction  by  a  factor 

greater  than  10  per  cycle. 

Each  cycle  costs  4.3  WU 

Insensitivity: Results  would 
be  practically  the  same 
for  any  .005  £  5  £  .5 
or  any  0  <  r|  <  .65 
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APPENDIX  C .  RIGOROUS  BOUND  TO  MODEL-PROBLEM  CONVERGENCE  RATE. 


We  consider  the  model  problem:  5-noints  Poisson  equation 
on  a  (n  +1)  x  {n2+l)  rectangular  grid  g”  with,  Dirichlet  boundary  conditions. 

Let  n.  =  2%.  and  let  G^  be  the  C2^^^+l)  x  uniform  grid  on  the  same 

domai^,  with^mesh  size  h,=2“\  ,  (k=o.  1,  M).  We  will  estimate  the 

ho  M 

convergence  rate  and  work  in  one  multi— grid  cycle  C  . 


The  cycle  is  defined  inductively  as  follows;  (i)  Make  r  relax¬ 
ation  sweeps  on  the  G®  approximate  solution  u  .  To  facilitate  the  rigorous 
Foiirier  analysis  we  choose  as  our  relaxation  the  Weighted  Simult^eous  Dis- 


placement  (WSD,  or  "weighted  Jacobi")  method  with  the  optimal  weights  cOqq-48/41, 
a)oi=Uo-i=“io'^“-10^®'^^^  (see  Sec.  3.3).  (ii)  Inject  (cf.  Sec.  A^4^  the 

(iii)  Get  an  approximate  solution  v  to  this 


residual  problem  to  G 


_M-1 


problem  by  cycles,  starting  from  the  zero  approximation. 

(iv)  Correct  u^  ^  u^  +  «^ere  I^^^  is  linear  interpolation. 

It  is  easily  calculated  that  one  WSD  sweep  amplifies  the  Fourier  com¬ 


ponent  exp  (i6 ’x/hj^)  of  the  residual  by  the  factor 


U(0)  =  1  -  (2  -  cose^  -  COS02)  (24  +  8cos0^  +  800382)  /  41. 


Denote  by  A(0)  the  amplitude,  before  the  c”  cycle,  of  tjje  0=( 8^,82)  com- 
ponent  of  the  residual.  Acutally  present  on  the  grid  G  are  only  con^xsnents 


of  the  form  0  =  (a^Tr/nj^,a2Tr/n2)  »  (“j=  +  1'  ±  2,  ...,  +(nj-l)),  and  t  eir 


amplitudes  A(0^,02)  =  -A(8^,-02)  =  -A(-8^,02)  are  real  (assuming  two  of 


the  boundciry  lines  to  lie  on  the  axes).  Since  y  (8^,82)  u(+6j^,j^02)  is  real. 


the  r  relaxation  sweeps  operate  separately  on  each  residual  mode,  trans¬ 
forming  its  amplitude  A(0)  to  A'(0)  =  y(6)  A(0). 

For 


:  any  component  6=  (6^^ ,  02^  such  that  ]  0  ]  -  max  ( I  0^^  1  r  I  1 )  i  ^ 

denote  0^  =  "  ^®1  ®  °  ^2-^' 

where  each  +  sign  is  chosen  so  that  |0*'|  <  ir,  (11=1,2,3,4)  .  Of  these  four 


"harmonics",  only  the  0^  mode  appears  on  G^  its  amplitude  there  (in  the 


right-hand  side  of  the  G^  ^  residual  problem  formed  in  Step  (ii))  being 

(C.l) 


=  A'  (8^)  +  A'  (0^)  +  A'  (0^)  +  A'  (0^) 

0 


Let  Ej^  denote  an  upper  bound  to  the  factors  by  which  any  C  cycle  reduces 
the  L^  norm  of  the  residuals  on  g’^.  In  particular,  the  two  C  cycles 
(Step  (iii))  are  equivalent  to  solving  a  G  problem  with  amplitudes  a^ 
instead  of  A*,  where 
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(C.2) 


alii  -  '“--|e|i|''» 


Hence,  interpolating  the  computed  correction  from  ^  to  (Step  (iv) ) , 
the  new  residual  amplitudes  are  easily  calculated  to  be 

A(e^)  =  A'  (6^)  -  s(e^)  a,, 


=  vi&Y  A(e^  -  S(eSAQ  +  SoS  (A  -a  ),  (£=1,2, 3,4,), 


where 


S(0)  = 


(1  +  cose^) (1  +  cose^)  (4  -  2cos0^  -  20030^) 

4  -  2cos20  -  2cos20„ 

1  2 


Hence 


(C.3)  I  A(0^)^  £  2q^  Z  A(0*')^  +  2  Z  S(0^)^  (A^-a.)^  , 

£  £  £  So 

where  g  is  any  upper  bound  to  the  spectral  radii  of  the  4x  matrices  0(0) 
defined  by  v; '  /  ' 


<»>  ■  -  ='a*))  mb")'  , 


(1  <  £,in  <  4)  . 


2 

Denoting  0^  =  1  -  cos  0^  ,  it  is  easy  to  check  that 
(C.4)  13(0^)"^  =  - i—  -  .2  )  <  1 

^  (BjL  +  $2)^  ^1  ^ 

Hence,  suxnming  (C.3)  over  the  relevant  range  of  0,  using  (C.2)  and  (C.4) 
and  then  (C.l) ,  we  obtain 


0  <  7T 


2  A{0)2  <  2q^  Z  A(0)^  +  Em  /  ^  A,,^ 

|e|  <  r  lei  <  -  ® 


<  (2q2  +  ye  ^  ^)  Z  A(0)2  , 

l0|  <  TT 

Where  y  is  any  upper  bound  to  all  Z  p(0^)^^,  (0  <  |e|  <  ■n/2) 

£  ~ 

Thus,  we  have  obtained  the  bound 


(C.5) 


2^2  4 


A  simple  comput^e^r  program  confirms  the  bounds  q^  =  (7/41)^  and 
M  — 
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The  number  of  operations  in  the  C  cycle  is  ^  (12r+3)n  n^  + 

Hence  by  Auction  L  M,  l(;24r.6)  n^n,.  He  thJ  have  an  snmary 

Theorem.  The  above  cycle  rednces.the  error  by  a  factor  .101  and 

costs  78  operations  (additions  and  multiplications)  per  (G^)  grid  po^. 

The  theorem  can  be  inproved  (to  .1  reduction  in  only  53  operations- 
aefinin,  tlTc®  cycle  to  consist  of  ftM  relanetion  sweeps 
LlT^ne  cycle,  and  choosing  large  r.  (Enploying  arbxtrarxly  large  r 

pays  only  with  simultaneous-displacement  schemes  on  rectangular  domains, 
^ere  there  is  no  feed  from  low  to  high  frequencies) . 

In  practice,  .1  reduction  is  obtained  in  about  26  operations.  (See 
App.  B.  The  Gauss-Seidel  sweep  erployeg  there  can  be  done 
per-point  But  for  every  3  sweeps  on  G  the  interpolations  I 
So  each  costing  an  average  of  6/4  oporatrons  per  pornt. 

a  work  unit  in  App.  B  should  be  considered  as  representing 

“SS'i  roperatioL).  These  operations  involve  only  additions  and 

shifts . 
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SINGULAR  VALUE  DECOMPOSITION; 
APPLICATIONS  AND  COMPUTATIONS 

Gene  H.  Golub 
and 

Franklin  T.  Luk 

Stanford  University,  Stanford,  California 


ABSTRACT.  The  Singular  Value  Decomposition  (SVD)  of  a  rec 
tangularliSt^iFis  described.  Several  problems  arising  in  data 
analysis  are  given  and  their  solution  is  given  in  terms  of  the  SVD. 
Numerical  methods  are  discussed  for  computing  the  decomposition  for 
dense  and  sparse  matrices. 


1.  INTRODUCTION.  This  paper  is  concerned  with  the  singular 
value  decomposition  of  a  given  matrix.  The  decomposition  is  very  use¬ 
ful  althou^  it  may  not  be  as  familiar  as  some  of  the  other  matrix 
decompositions.  We  shall  describe  the  decomposition,  give  some 
specific  examples  of  its  applications,  and  suggest  some  methods  to 
compute  the  decomposition. 

There  are  many  matrix  decompositions  that  are  useful  in  mathe¬ 
matical  applications.  A  very  familiar  one  is  the  QR  decomposition  of 
a  square  matrix  A; 

A  =  QR  , 


where  Q,  is  an  orthogonal  matrix  and  R  is  an  upper  triangular  matrix. 

There  are  several  numerical  schemes  to  compute  this  decomposition.  We 

could  use  the  Gram-Schmidt  method;  the  columns  of  Q  are  the  orthogonal 

columns  generated  by  the  process.  Another  way  to  generate  Q  and  R 

is  through  the  use  of  Householder  transformations. 

Another  familiar  decomposition  is  the  reduction  of  a  square 

matrix  to  its  Jordan  canonical  form: 

-1 

A  =  XJX  j 

where  X  is  nonsingular  and  J  is  a  block  diagonal  matrix  in  which 


*This  work  was  in  part  supported  by  U.S.  Army  Research  Grant 
DAHC04-75-G-0185. 
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This  decomposition  has  been  used  extensively  in  the  study  of  stability 
of  differential  equations.  Unfortunately,  there  does  not  appear  to  be 

any  good  numerical  algorithm  to  compute  the  decomposition  (Uolub  and 
Wilkinson  [l4  ] ) , 

Finally,  we  shall  discuss  the  singular  value  decomposition  of 
an  m  X  n  matrix  A; 


A  =  , 

where  U  is  an  m  X  m  orthogonal  matrix,  V  is  an  n  X  n  orthogonal 
matrix,  and  S  is  an  m  X  n  matrix  with  non-negative  elements  down  the 
main  diagonal  and  zeros  everywhere  else.  For  our  discussion,  we  shall 
assume  that  A  has  at  least  as  many  rows  as  columns  so  that  m  >  n, 
although  this,  is  not  always  the  case.  There  are  many  proofs  of  this 
decomposition,  for  instance,  in  the  book  by  Forsythe  and  Moler  [6]. 

A  very  clear  and  useful  discussion  is  given  in  the  book  by  Lanczos: 
"Linear  Differential  Operators"  [17]. 

It  is  not  very  difficult  to  see  that  U  consists  of  the 
eigenvectors  of  AA  ,  V  consists  of  the  eigenvectors  of  A^A  and 
the  diagonal  elements  1  <  i  <  n,  of  E  are  the  non-negative 

square  roots  of  the  eigenvalues  of  a’*^A.  We  assume  the  cr. 's  are 
arranged  in  such  a  way  that  ^ 
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<^1  >  <^2  ^ 


>  cr 

—  r 


>  0 


"wliBir©  V  is  "bliB  psink  of  ilio  ms-tirix  A* 

The  singular  values  and  the  eigenvalues  of  a  given  matrix  can 

frequently  differ.  Consider  an  m  X  m  matrix 


The  matrix  A  is  of  rank  m-1  but  .all  its  eigenvalues  equal  0.  However, 
(m-l)  singular  values  of  A  equal  1  and  only  one  singular  value  is 
zero.  Hence  the  number  of  non-zero  eigenvalues  of  a  matrix  gives  a  lower 
bound  on  its  rank,  whereas  the  number  of  non-zero  singular  values  of 
a  matrix  is  its  rank. 


2 .  APPLICATIOHS .  In  this  section  we  shall  discuss  some 
applications  of  the  singular  value  decomposition  (cf.  Goluo  [8]). 

A.  Let  be  the  set  of  all  m  X  m  orthogonal  matrices. 

We  wish  to  replace  a  given  m  X  m  matrix  A  by  an  m  X  m  orthogonal 
matrix  Q  that  is  near  A.  In  order  to  study  the  nearness  of  one 
matrix  with  respect  to  another  matrix,  we  introduce  a  norm;  we  use  the 
Frobenius  norm  of  a  matrix^  ^ 

||A||.(Z  U  I")"/", 
ijj  ^ 

We  shall  use  this  matrix  norm  throughout  this  discussion.  Our  problem 
then  consists  of  the  following:  let  A  be  an  arbitrary  m  X  m  matrix; 
determine  Q,  €  such  that 
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||A  -  qII  <  ||A  -  Xll 


for  any  X  € 


This  problem  is  important  in  factor  analysis  and  has  also  found  appli¬ 
cations  in  aeronautics  (cf.  Bar-Itzhack  [1]). 

The  solution  to  the  problem  is  fairly  simple.  It  is  as  follows 
if 

A  = 

then  we  replace  all  the  singular  values  by  1  and  write 

Q  =  UIV^  . 

It  is  well-known  that  the  singular  values  of  an  orthogonal  matrix  all 
equal  1.  Wow, 

||a  -  qII  =  ||U2V^  -  uiv^ll 

=  11^  -  I II  since  the  Frobenius  norm  is  unitarily 
invariant^ ^ 

=  [  (cr  -l)^  +  (cT  ~l)^  -f.  (cr  . 

X  ^  n  ^ 

this  value  then  is  a  measure  of*  the  departure  from  orthogonality  of  a 
given  matrix.  The  result  is  true  for  all  unitarily  invariant  norms 
(Fan  and  Hoffman  [5]). 

B.  We  consider  the  following  important  generalization  of 
problem  A.  Let  A  be  an  m  X  n  matrix  associated  with  a  set  of  data 
and  let  B  be  obtained  from  A  through  a  rotation  of  the  data.  The 
following  figure  may  represent  a  typical  situation: 


A 


t 


A  norm  is  said  to  be  unitarily  invariant  if 
where  U*U=I  and  V*V  =1.  " 


B 


||AU||  =  ||VA||  =  ||A|j 
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Our  idea  is  to  replace  A  ly  BQ,  that  is,  we  wish  to  replace  A  by 
a  rotation  of  B.  We  want  to  determine  Q  e  such  that 

||A  -  BQII  =  min  . 

The  solution  is  again  given  in  terms  of  the  singular  value  decomposition. 
Green  [15]  and  Schonemann  [21]  showed  that  if 

B^A  =  USV^ 

and  t 

Q  =  UV  , 

then 

IIa  -  BQII  <  llA  -  BX||  for  all  X  £  . 


C.  Let  be  the  set  of  all  m  X  n  matrices  of  rank  k. 

(  ^  ^  ^  C  k. )  /  \ 

Assume  A  £  7/1^  ■  We  want  to  determine  B  £  ^  ^^  (k  <  r)  such  that 

in.)  n  ^ 

fk) 

||A  -  b||  <  ||A  -  x||  for  all  X  £  . 

In  other  words,  we  want  to  approximate  the  matrix  A  with  a  matrix  of 
lower  rank  and  we  want  the  best  approximation  for  the  fixed  rank.  The 
solution  is  given  in  terms  of  the  singular  value  decomposition. 

Let  A  =  UXV^,  then  B  =  where 


Now 
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Mirksy  [l8]  showed  that  the  above  result  is  true  for  all 
unitarily  invariant  norms. 

Consider  the  following  example.  Let 


Mathematically^  the  matrix  is  of  rank  2.  But  the  following  rank  1 
matrix 

differs  from  A  by  only  10  ^  and  is  the  closest  matrix  of  rank  1  to  A. 


D.  The  singular  value  decomposition  also  enters  in  the  com¬ 
putation  of  the  pseudo-inverse  of  a  matrix.  An  n  x  m  matrix  X  is  a 
pseudo-inverse  of  an  m  x  n  matrix  A  if  it  satisfies  the  following 
four  relations: 

(i)  AXA  =  A, 

(ii)  XAX  =  X, 

(iii)  (AX)"*^  =  AX, 

(iv)  (XA)"*^  =  XA  . 
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We  can  easily 


The  pseudo-inverse  X  is  unique  and  we  denote  it  by  A  . 
verify  that  given 

A  =  , 

we  always  have 

A^  =  VAU^  , 

where 


Consider  the  following  problem.  Suppose  we  have  an  m- vector 
b  and  an  m  x  n  matrix  A.  We  would  like  to  determine  an  n-vector  x 

such  that 

llAx  -  bll2  =  min.^^ 

If  A  is  not  a  matrix  of  full  rank,  we  do  not  have  a  unique  solution 
to  the  prohlem.  Let 

X  =  (si  11^  -  fell2  "  • 

we  would  like  to  determine  x€x  such  that  illg  is  a  minimum. 

2l  ~  ^ 

lijrll  =  (  for  x  =  2.’ ”  ’  ’  ’ 

i=l 


583 


+  + 

The  solution  is  given  by  x  =  A  b.  Hence  if  we  had  A  ,  it  would  be 
fairly  simple  to  compute  a  sequence  of  solutions  (x.)  given  the 
sequence  of  data  fb.}. 

Unfortunately,  the  pseudo-inverse  of  a  matrix  is  not  a  continuous 
function  of  the  elements  of  A.  If  we  let 


A(e) 


where  e  >  0,  then 


A''(€) 


But 


a'^(o) 


0 


\0  0  / 

Hence  for  a  small  positive  we  see  that  A'^(g)  is  quite  different 
from  A  (O).  Thus  the  computation  of  the  pseudo-inverse  is  quite  an 
ill-conditioned  problem. 

If  we  want  to  compute  the  pseudo-inverse  in  a  stable  way^  we 
must  impose  some  additional  conditions.  We  shall  give  one  possibility 
which  seems  quite  satisfactory. 

Suppose  we  are  given  a  matrix  A  but  we  also  know  that  the 
matrix  is  really  some  matrix  B  plus  some  perturbation  A^,  viz.^ 


A  =  B  +  A  . 


We  do  not  know  B  but  we  know  some  bound  on  the  errors 

IIaII  <  ; 

for  example,  this  would  happen  if  the  elements  of  A  were  empirical 
data  with  known  uncertainties.  We  wish  to  determine  B  such  that 


I|a  -  b||  <  ti  , 
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and 


rank(B)  =  min  . 


The  solution  is  given  by  the  singular  value  decomposition.  If  we  write 


\  - 


then 

B  =  B 

P 

if 

2 

.  2  ^ 

2 

•  •  •  +  a 

r  — 

n  ^ 

and 

2  2 

2 

^p  ^  Vi 

...  +  a^> 

■n  • 

Note  that  although 

||A  -  bII 

yet 

iiA^  -  rii  =  [4-  + 
\Vi 


2 


1/2 


E.  We  may  use  the  singular  value  decomposition  to  solve 
homogeneous  equations.  Suppose  A  is  an  m  X  n  matrix  of  rank  r.  Let 

AV  =  UE  . 

We  partition  V  into  an  nX  r  matrix  and  an  n  x  (n-r)  matrix 

i.e. 

V  =  , 

and 

A{Y^,Y^)  =  (UE^,0)  , 

where 
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Then 


m  X  r 


and-  we  have  found  an  orthogonal  basis  for  the  null  space  of  A. 

Given  a  set  of  eigenvalues  of  a  square  matrix^  we  need  to 
solve  a  set  of  homogeneous  equations  in  order  to  find  the  eigenvectors. 
Golub  and  Wilkinson  [ 14]  used  the  idea  to  compute  the  Jordan  canonical 
form  of  a  matrix. 

Often^  we  wish  to  know  which  columns  of  a  given  matrix  A  are 
linearly  independent.  If  A  is  a  set  of  measurements  and  if  some 
columns  are  dependent,  we  may  want  to  determine  which  are  the  dependent 
columns,  eliminate  them  and  obtain  a  linearly  independent  set  of  measure¬ 
ments.  The  singular  value  decomposition  can  be  very  effective  for  this 
purpose. 

/  ^  \ 

Let  A  ^  ^  let  the  last  column  of  A  consist  of 

all  zeros.  We  find 


from  which  we  see  we  should  eliminate  the  last  column  of  A. 

In  general,  we  want  to  take  and  perform  Gaussian  elimina- 

t 

tion  with  complete  pivoting  on  such  that 


pv^n  =  (\)n)  , 


where 


586 


□  A 


is  an  (n-r)  X  (n-r)  upper  triangular  matrix 
is  an  (n-r)  X  r  matrix,  and 
is  an  n  X  n  permutation  matrix. 

Then  if 


An  =  (A^,A2)  , 


we  can  decide  that  the  columns  of  Ag  form  a  linearly  independent 
basis  for  the  columns  of  A.  This  and  other  problems  of  dependence 
axe  discussed  extensively  in  a  paper  by  Golub,  KLema  and  Stewart  [lO]. 


F.  Another  problem  is  the  following.  Consider 


max 


Itellg  M2  * 


It  is  not  difficult  to  see  that  the  maximal  value  of  the  normalized 
bilinear  form  is  0^,  which  is  attained  when  S.  =  3  =  Zf 

where  is  the  largest  singular  value  of  A,  and  u^,  v^  are  the 

corresponding  left  and  right  singular  vectors,  respectively. 

Let  X  be  an  m  X  s  matrix  and  Y  be  an  m  x  t  matrix.  Consider 

I  =  Xu  and  a  =  Yv  . 

The  angle  0  between  ^  and  a  given  by 

k\ 

" '  1111I2  ibirr  ■ 

We  can  choose  |  and  a  maximize  the  normalized  inner  product. 

We  call  the  maximal  value  the  canonical  correlation  and  the  correspond¬ 
ing  angle  (say  9)  the  angle  between  the  two  subspaces  U  and  V. 

We  can  determine  0  very  easily  using  the  singular  value 
deomposition.  We  compute  the  QR  decomposition  of  X  and  Y,  viz. 
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Then 


X  =  QR  and  Y  =  PS 


9  =  cos"^(a  (q’^P))  . 

max  ' ' 


The  computation  can  he  carried  out  even  -when  X  and  Y  have  less  than 
full  rank  (Bjorck  and  Golub  [2]). 


G.  One  further  application  of  the  singular  value  decomposition 
is  in  computing  the  parameter  A  in  ridge  regression  using  the  cross 
validation  technique  (Ooluh^  Wahba  and  Heath  [I3]), 

Given  an  m  x  n  matrix  K  of  rank  r  and  an  m-vector  g.  We 
•wish  to  minimize 


2 

2  • 


Using  the  variational  technique^  we  see  cp(^)  attains  its  minimum  at 
f  =  f  where  f  satisfies 


+  AI)f  =  . 


Hence  we  have  a  ridge  regression  problem.  The  question  is  how  to 
choose  A.  One  possibility  is  to  try  to  estimate  A  from  the  data; 
we  shall  describe  one  method  based  on  cross  validation.  We  shall 
see  how  the  singular  value  decomposition  of  K  aids  us  in  both 
choosing  A  and  solving  for  f  for  the  chosen  value  of  A. 

Let  denote  the  (m-l)  X  n  matrix  obtained  by  leaving 

out  the  j-th  row  of  K,  and  let  denote  an  (m-l) -vector  obtained 

by  leaving  out  the  j-th  component  of  g,  viz. 
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and 


Let  denote  the  solution  to 

rs>  '  ' 

'  rsJ 


®ie  cross-validation  weighted  square  error  CV(A)  is  defined  hy 


ev(i)  =  S  w,[g,  -  k^:  f^^^(A)]^  , 


j.l  ^  ^ 


\diere 


w.  >  0. 

J 


We  wish  to  choose  A  such  that  CV(a)  is  a  minimum.  We  see 


CV(A)  =  Z  w  [g  -  +  AI)"^ 

3=1  ^ 


m 


D 

•where  e.  =  (O^ .^0^1,0,  .•  .^0)  . 
J  ^  -  "■» 


ts-l  ,.,t 


t^  -,2 


We  apply  the  Sherman-Morrison  formula  to  obtain 

'  '  '  d 


(k\  +  xc  -  k,ki)'^  =  (k\  +  AI)'^  +  a:^(K^K  +  AD'^k.kJCK'^K  + 


AI)■^ 
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where 


a.  =  1  -  k’':(K^K  + 
a  ~D 


a.  /  0  by  assun^tion  . 
3 


After  some  additional  computations,  we  get 


CV(A)  =  liB(A)[I  -  K(K*K  +  Al)'V]gf  , 


where  B(A)  is  an  m  X  m  matrix  given  hy 


{B(a)).3^  =  wp[l  -  kJ(K\  +  AI)"^  k^]"^  , 


and  {b(a)}..=0 

r  j 


for  i  7^  d  . 


We  factorize  K  as 


K  =  UZV  , 


i.e.,  the  singular  value  decomposition  of  K*  Then 


CV(A)  =  ||B(A)[g  -  UZCs'^Z  +  AI)"Vg]f  , 


where  g  =  U  g.  Now, 


{B(A)},,  =  wy^[l  -  k^V(E^Z  +  AI)“Vk,]"^ 


11  1 


But  since 


we  obtain 


KV  =  UE  , 


Z  u^.cp.(A))"^  , 


where 
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q>.(A)  =  p-"^—  , 

^  a.  +  -h 


and  cr.'s  are  the  singular  values  of  K.  Finally, 

r  r  f 

m  %  - 

CV(A)  =  Z  w  - ^ -  , 

1-  Z  u"cp.(A) 

L  j=i  ^  J 

which  is  very  easy  to  evaluate. 

For  a  chosen  value  of  A,  we  may  solve  the  ridge  regression 
problem  easily  using  the  singular  value  decomposition  of  K.  We  have 

(k'^^K  +  Al)f  =  , 


which  reduces  to 


Y{zh  +  AI)v'‘^f  =  YZ^§ 


Hence 


f  =  V(z'^E  +  AD'^z"^! 


r  o-.g . 
d-l  cr  A  ^ 


where 


V  =  (Vp.  ^2’  • 


•  ’  :5fn' 


Many 


numerical  experiments  have  heen  carried  out  in  [l3]» 
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5.  COMPUTING  THE  SIUaULAR  VALUE  DECOMPOSITION  OF  A  DENSE  MATRIX 
Our  basic  tool  is  the  Householder  transformation.  Consider  a  matrix 
of  the  form 

=  I  -  , 

rs.'  ' 

where 


Note  that  the  matrix 
denote  the  original  matrix, 
elements  below  the  diagonal 


is  symmetric  and  orthogonal.  Let 
We  construct  to  annihilate  all 

in  the  first  column  of 


We  next  apply  a  Householder  transformation  on  the  right  of 

^(5/2)^  and  our  idea  is  to  eliminate  all  elements  to  the  right  of  the 
(1,2)  position  in  the  first  row  of  without  disturbing  the  zero 

elements  in  the  first  column: 


Our  process  continues  with 
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^(k+1/2)  ^p(k)^(k)  ^ 

where  the  effect  of  ^  is  to  eliminate  all  elements  below  the 

(k) 

diagonal  in  the  k-th  column  of  A  ,  and  with 

^(k+l)  ^  ^(k+l/2)Q(k)  ^ 

where  the  effect  of  is  to  eliminate  all  elements  to  the  right 

(k+1/2  ) 

of  the  (k,k+l)  position  in  the  k-th  row  of  . 

The  end  result  is  that  we  have  n  transformations  on  the  left 
((n-1)  transformations  if  m  =  n),  and  (n-2)  transformations  on  the 
right  of  A; 


We  now  apply  the  QR  method  due  to  Francis  [ 7]  and  Kublanovskaya  [  16] 
(Golub  and  Kahan  [9])  so  that 

J  =  , 


i.e.,  the  singular  value  decomposition  of  J.  If  we  write 


and 


then 
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-where 


A  =  PJQ^ 

=  (PX)  E(QY)'’^ 

=  UZV* 


U  =  PX,  V  =  QY  . 

The  first  program  to  do  the  ahove  computations  was  given  hy 
Golub  and  Reinsch  [12].  A  version  for  complex  matrices  was  given  by 
Businger  and  Golub  [3]  A  program  for  real  matrices  is  available  in 
Release  2  of  EISPACK  [4], 


4.  COMPUTING  THE  SINGULAR  VALUE  DECOMPOSITION  OF  LARGE 
SPARSE  MATRICES.  ¥e  have  several  possibilities  for  computing  the 
singular  value  decomposition  of  a  large  and  sparse  matrix.  In  most 
problems,  we  want  only  the  few  greatest  singular  values  of  a  large 
matrix^  for  instance,  in  image  reconstruction,  the  order  of  the  matrix 
frequently  exceeds  10,000  but  only  very  few,  generally  less  than  100, 
of  the  greatest  singular  values  are  of  physical  significance. 

A.  Standard  Lanczos  algorithm.  The  best  available  algorithm 
for  computing  a  few  of  the  greatest  singular  values  of  a  large  sparse 
matrix,  say  A,  is  the  Lanczos  algorithm.  The  algorithm  uses  the  matrix 
A  only  in  the  computation  of  the  matrix -vector  product  Ax  or  A^x 
given  a  vector  x.  Hence  we  can  use  the  sparsity  of  A  to  compute 
the  products  very  efficiently.  Unlike  other  methods  that  transform 
the  matrix^  the  Lanczos  algorithm  preserves  the  matrixes  sparse 
structure  and  works  well  even  if  the  matrix  is  so  large  that  it  has  to 
he  stored  on  some  auxiliary  device  (e.g.  magnetic  disk  or  tape). 

We  use  the  Lanczos  algorithm  to  hi  diagonalize  a  given  m  x  n 
matrix  A; 

A  =  PJQ'*^  , 

where 

P*P  - 


and 
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1 


We  can  expand  the  two  resultant  equations: 

AQ  =  PJ  and 

in  terms  of  the  columns  of  P  and  of  Q  to  yield 


p'^^A  =  , 


As. 


i+1 


«lSi  ^ 

“iSi  *  Mi+1' 


t 

Ythx 


1  —  2^ • * • ^  n^l 


So  our  algorithm  is 
(l)  Choose  such  that  Ils^|l2 

Set 

w^  =  Aq  , 


=  1, 


w, 


lll2^ 


-1 


£i  =  “i  i^i  • 


595 


(2)  For  i  =  1^2^..._,S“1  (2  <  s  <  n)^  compute 
.  t 

z.  =  A  p.  -  a.q.  , 

Pi  =  Ibillg . 

Si+i  ^i  ^i  ’ 

^i+1  =  ^-9.1+1  -  Pi-Bi  ' 


^i+1  " 


*1+1  '  “irfi+i 


For  some  s  <  we  denote 


(s) 


•  • 


a  _  p 

s-1 


a 


and 


=  (Ei^  •••  ^  Eg)  ^ 

^  B2’  •••  ’  • 


{’si 

We  now  apply  the  QR  method  on  J  ^  so  that 

j(s)  ^  ^(s)^(s)Y(s)t 

fsl 

i.e,^  the  singular  value  decomposition  of  J  , 


Let 


596 


1 


(s) 


where  >  *••  >  >  0, 

v(s)  _  /'^(s)  ^(s) 


and 


_  (^\s>  \ti} 

-  VX^  ,  Xg  ,  . . .  ,  , 


Y  -  '^fLl  >  ^2  ’  ^ 


The  and  usually  accurate  approximations 

to  the  largest  singular  value,  and  the  corresponding  left  and  right 
singular  vectors,  respectively,  of  A.  We  may  apply  the  Kaniel-Paige 
theory  [l9]  to  show  that  if  9  >  0  is  the  angle  between  and  v^, 

then 


2  (s) 

^1  "  ^1  -  '^l  1  1  " 


where 


2  2cr^  tan  6 

'i‘t2  (lixT 

‘s-l'l  -  r' 


and 


T  is  the  (s-l)-st  Chehyshev  polynomial  of  the  first  kind^ 
s-1 


r  = 


^1  ^2 
2cr_ 


(s) 


We  construct  an  example  to  show  how  cr^  generally  approxi¬ 
mates  cr^  well  even  for  a  small  s.  Let  cr^  =  1,0,  cr^  =  0.9^  s  =  20 
and  0  =  cos  ^  0,1.  Then 


=  99  , 


tan^  e  = 


0.1 


and 


Hence 

and 


■Y"^  ^  =  1.105 
1  -  T 


T^^(1.105)=  2.8  X  10^  . 


2  -  2  •  1  •  99  .  „  ^ 

=  2.5  X  10 

^  (2.8  X  10^)2 


(T^  -  0.000025  <  <  0-^ 


Since  n  is  usually  very  large,  we  often  choose  some  s  «  n 
subject  to  storage  availability.  If  our  convergence  criterion  for  the 
singular  value  is  not  satisfied,  we  may  use  as  the  new 

initial  vector  and  restart  the  Lanczos  algorithm.  Since  the  accuracy 
of  our  approximation  is  bounded  by  tan  6,  where  6  is  the  angle 
between  our  initial  vector  and  v^,  we  expect  to  obtain  better  approxi¬ 
mations  ‘if  we  iterate  the  Lanczos  algorithm.  If  z.  (or  w  )  =  0 

for  some  ^  ^  we  coulci  continue  the  algorithm  hy  choosing  some  z. 

1 

(or  w^)  orthogonal  to  all  the  previous  z^’s  (or  w.'s),  j  <  i.  We 
could  also  choose  to  terminate  the  algorithm  because  z.  (or  w  )  =0 
usually  means  some  singular  values  have  converged. 

The  sequences  of  vectors  and  form  orthogonal  sets 

in  exact  arithmetic.  Hence  theoretically^  we  need  only  to  keep  the 
most  recent  pairs  of  and  in  memory^  providing  great 

savings  in  storage.  Unfortunately,  the  sequences  ^ 

generally  lose  orthogonality  very  quickly  due  to  cancellation  errors 
in  the  computations  of  the  ^  remedy  is  to  reorthogo- 

nalize  the  most  recently  computed  (or  with  respect  to  all 
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the  previous  g.’s  (or  ^.’s),  j  <  i.  But  this  task  is  expensive  in 
both  execution  time  and  storage,  because  we  must  now  store  all  the 
computed  and  {^.)  in  memory.  Paige  [ 19]  argues  against  the 

necessity  of  reorthogonalization,  but  the  matter  is  still  a  subject  of 
controversy. 


B.  Block  Lanczos  algorithm.  In  many  cases  we  may  save  work 
if  we  iterate  with  a  block  of  vectors  instead  of  a  single  vector.  The 
saving  could  be  considerable  if  we  were  computing  a  multiple  singular 
value.  In  general,  if  we  had  some  a  priori  knowledge  of  the  singular 
value  spectrum,  we  could  choose  an  appropriate  block  size  with  good 
gains.  Computer  experiments  (Golub,  Luk  and  Overton  [ll])  show  that 
if  we  want  several  of  the  largest  singular  values,  we  often  gain  by 
choosing  a  block  size  p  >  1.  Also,  if  the  matrix  is  stored  on  an 
auxiliary  device,  we  may  make  some  gains  in  efficiency  if  we  multiply 
the  matrix  into  several  vectors  simultaneously. 

In  a  similar  way  to  the  standard  Lanczos  algorithm,  our  block 
version  reduces  the  matrix  A  to  a  block  bidiagonal  form.  We  start 
with  an  arbitrary  n  x  p  matrix  and  perfom  a  QR  factorization  of 

the  product  AQ^; 

P^A^  =  AQ^  , 


where  P^  is  an  m  X  p  matrix  such  that  P^P^  -  I  , 

and  A^  is  a  p  x  p  upper  triangular  matrix. 

Our  algorithm  continues  with 


and 


«A.i  =  **‘’1-1 


P.A.  =  AQ. 
11  1 


-  P 


i-r 


B 


t 

i-1 


.? 


i  —  f 


T^ere  ^  and  P^A^  are  the  QJR  factorizations  of  the  respective 

right-hand  sides^  and 
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is  an  n  X  p  matrix  such  that  =  l  , 

P.  is  an  m  X  p  matrix  such  that  P*P  =  i  . 

^  i  i  ^ 

and  both  are  p  x  p  upper  triangular  matrices. 

We  have  tacitly  assumed  p  x  s  <  n.  We  consider  the  ps  X  ps 

block  tridiagonal  matrix 


which  is  also  handed  upper  triangular  with  bandwidth  =  p+1. 

We  can  reduce  to  bidiagonal  form  using  the  Householder 

transformations.  We  can  also  use  plane  rotations  to  reduce 

bidiagonal  form  to  take  advantage  of  the  sparse  banded  structure  of 
^(s 


to 


P.  . 


•  A  plane  rotation  in  the  (i^J) -plane  is  an  orthogonal  matrix 
of  the  fom 
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P  2 

where  r  +  a  -  1.  It  is  easy  to  verify  that  given  a  vector  x  we 

can  choose  r  and  cr  such  that  P. .  annihilates  the  j-th  component 

( s ) 

of  X.  We  give  a  simple  example  to  demonstrate  how  we  can  reduce  J 
using  plane  rotations. 

Suppose  we  have  the  following  6x6  matrix 


A  = 


We  construct  a  plane  rotation  ^23^  postmultiplying  A  to  annihilate 
the  (1,3)  element.  The  rotation  creates  a  non-zero  element  in  the  (3,2) 
position,  i.e.. 


Wow  we  apply  a  plane  rotation  I’23'’  P^s^'altiplying  -^^25  eliminate 
the  (3,2)  element.  A  new  non-zero  element  appears  in  the  (2,5)  position: 


We  construct  to  annihilate  the  new  nonzero  (2,5)  element  from  the 

right: 
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X  X 


XXX 


X 

1 

1 

X 

X 

V 

X 

X 

X 

m 

X 

X 

X 

An  appropriate  plane  rotation  from  the  left  will  annihilate  the 

newly  created  (5? 4)  element  without  creating  new  nonzero  elements: 


^45^23‘^'^23‘^45 


X  X  0  ^ 

XXX  f 

X  X  X  ^ 
\  XXX 

X 


We  say  we  have  "chased"  away  the  (l,3)  element  of  A  (cf.  Rutishauser 

[20]). 

We  may  determine  the  singular  value  decomposition  of  the 
resultant  bidiagonal  matrix  using  the  QR  method.  Using  a  theorem  due 
to  Underwood  [22],  we  can  show  that  the  p  largest  singular  values 
of  J  are  usually  accurate  approximations  to  the  p  largest 
singular  values  of  A.  In  fact,  if  >0  is  the  smallest  singular 

value  of.  Q^V^,  where  consists  of  the  first  p  columns  of  V, 

then  for  k=l,2, ...,p, 


where 


4  £  4°’  <  s . 


(c^i  +  CTj^) 


tan^  0 

P  1  Tt 

s-1^1  -  Tk' 
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®  Vn  ’ 


~  %+l 

^k  o'lj  ^ 


Sind  T  is  the  (s-l)-st  Chebyshev  polynomial  of  the  first  kind. 

We  consider  an  exan^ile  similar  to  the  one  we  have  given  in 
the  previous  section.  Let  =  1.0,  cr^  =  0.9>  -  0.5^  P  ~  2,  s 

and  9  =  cos  ^  0.1.  Then 


tan  6  =  99> 


‘"l  "  "^3  0.5  _  n  oc: 

ri  =  ^7^  "  ^ 


^2  ~  '^3  _  0.4  ^ 
^2  ~  CT^  +  cr,  1.9 


0.21  , 


1  +  r 


^  =  1  67 

0.75  ^  " 


Hence 


T'^  "  0.79  “  ’ 


T^(1.67)  =  10  , 


T^(1.53)  =  3.7  X  10- 


2  ^  2  X^  t  2.0  X  10"^  , 
•*•  10 


and 


2  .  1.9  X-9-9^  i  1.4  X  10-5 


(5.7  X  10^  f 


Comparing  the  two  examples,  we  can  see  how  a  proper  choice 
of  the  block  size  would  save  us  work  with  the  same  limitation  on 


10 
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storage  space.  In  general,  a  good  choice  of  p  depends  on  the  singular 

value  spectrum,  the  number  of  singular  values  desired,  and  the  avail- 

/\ 

ability  of  memory.  If  there  is  a  cluster  of  p  largest  singular 
values,  it  usually  pays  to  choose  p  =  p.  Often,  the  knowledge  is  not 
available  and  a  satisfactory  rule  appears  to  be  choosing  p  etjual  to 
the  number  of  singular  values  we  want  to  compute.  Our  tests  [llj 
show  that  the  reorthogonalization  of  each  recently  computed  P.  (Q. ) 
with  respect  to  all  the  previous  P^'s  (Q^.'s),  j  <  i,  is  necessary  for 
accurate  results.  We  therefore  must  keep  all  the  P.'s  and  Q  's 

.  i  i 

in  memory,  effectively  bounding  the  value  p  x  s. 

An  algorithm  will  soon  be  published  [ 11] . 
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